Enhancement of Effective Spatial Data Analysis using R
|
|
- Sylvia Gilmore
- 5 years ago
- Views:
Transcription
1 Indian Journal of Science and Technology, Vol 9(21), DOI: /ijst/2016/v9i21/95149, June 2016 ISSN (Print) : ISSN (Online) : Enhancement of Effective Spatial Data Analysis using R S. Palaniappan 1*, T. V. Rajinikanth 2 and A. Govardhan 3 1 Department of CSE, Saveetha University, Chennai , Tamil Nadu, India; s.palani.in@gmail.com 2 Department of CSE, SNIST, Hyderabad , Andhra Pradesh, India; rajinitv@gmail.com 3 Department of SIT, JNTUH, Kukatpally, Hyderabad , Andhra Pradesh, India; govardhan_cse@yahoo.co.in Abstract Background: The availability of Spatial Data which is a part of GIS is growing day by day in an exponential manner. This high availability of data is throwing challenges to the research community to analyze and draw effective conclusions. The Present Study aims at requirement for effective analysis and to draw Conclusions. Methods/Statistical Analysis: Spatial Analysis requires logical relationships between attribute data and map features.spatial data Analysis is not a simple single task it requires complex procedures in which combinational techniques namely Hybrid techniques are required for effective analysis. Mathematics and statistics are the fundamentals to spatial data analytics. In this paper, a realistic Spatialcrime data set was considered for analysis. It involves different types of data mining Techniques like Clustering, Classification and Association rule mining techniques apart from Hybrid techniques. These hybrid Data mining Techniques were applied using R. Findings: The Hybrid Data Mining techniques with K-means Clustering and J48 Decision Tree Algorithm was developed and Applied for the enhancement of accuracy. Association Rule generation Apriori algorithm was applied on the resultant K-means clustered data set. The application of 3D visualization techniques also made for further analysis. Applications/Improvements: It is essentially required to analyze these complex spatial data sets effectively. So there is a need of hybrid Spatial Data Mining Techniquesrequirement for effective analysis and to draw Conclusions. Keywords: Complex Spatial Data Sets, Effective Spatial Data Analysis, Hybrid Data Mining Techniques, Spatial Data Analysis, 3D Visualization Techniques 1. Introduction Spatial Analysis or Spatial Statistics 1 are the techniques which are used to study properties of topological, geometrical or geographical objects. It is used to analyze the spatial data. The types of Spatial analysis includes Spatial data Analysis, Spatial Auto correlation, Interpolation, Regression, Interaction and Simulation and Modeling. Spatial analysis 2 is the process by which we turn raw data into useful information. It is a set of techniques for analyzing spatial data. It is the analysis of Geographical data sets. Shapefiles are the vector data and composed of a few required files a).shp the shape format file b).shx the shape index file c).dbf the attribute file. R language was best suitable choice for spatial data analysis. Statistical inference deals with drawing of conclusions based on data and pre assumptions. A Spatial Data Mining is nothing but application of traditional Data Mining (DM) methods to spatial data. SDM is the process of discovering required relationships and patterns that may exist implicitly in spatial databases 3. Spatial data mining 4-6 plays an important role in extraction of interesting patterns and finding relationships between spatial and non spatial data. 2. Literature Survey A Spatial Data set 7 is the geographical information that is used to represents the features and boundaries of a location on earth. It is stored in the form of co-ordinates and topology with mapping of data. GIS systems are used to access, analyze and manipulate the Spatial Data sets. The example of Spatial data 8 are the satellite images. A *Author for correspondence
2 Enhancement of Effective Spatial Data Analysis using R spatial database 7 is a database is used to optimize and store data apart from query objects data defined in a geometric space. Normally these are used to represent simple geometric objects such as points, lines and polygons. Spatial Data Mining (SDM) 8 techniques are useful to extract potentially useful knowledge. It is used to find hidden patterns, relations between spatial data and/or non-spatial data. Spatial relationships have great importance in the analysis process 9,10. The traditional Data mining Techniques are not suitable to spatial data as they do not capable to handle data belong to location. So it is required to look for new methods to address spatial relationships and spatial data. Spatial relationships measuring is a time taken process as it generates large volume of data when geometric location is encoded. To query and perform spatial analytical tasks GIS software s are useful. GIS are not much useful in this regard. So it is necessary to integrate the available methods with the spatial data mining methods 15, K-Means Cluster Algorithm K-means clustering 17,18 is a DM algorithm used to cluster the observations into groups of similar observations without having any previous knowledge about those relationships. It is also called simple K-means algorithm. It is applied to the domains like namely medical imaging, biometrics, Spatial, Spatio-temporal etc. 2.2 Working of k Means Algorithm 1. Put K points into the space denoted by the objects to be clustered. 2. These points denote initial group of Centroids. Assign every object to the closest Centroid group. 3. Recalculate the positions of the K Centroids after every object has been assigned. 4. Repeat the above two Steps 2 and 3 until there is no change in Centroids. 2.3 Decision Tree Itis used as a predictive model which maps observations ofan item to conclusions of that item s target value. There are 91 classification types. In this the class labels are represented by leaf nodesand branch nodesdenotes features that directs to these class labels. J48 algorithm is an advancedversions of C4.5 algorithms. The output of this J48 algorithm results a Decision tree. 3. Proposed Approach The Sacramento crime January 2006 file 19 contains 7,584 crime records, as made available by the Sacramento Police Department, California. This was posted by Spatial Key 20 The crime spatial data set was collected for Spatial/Non-Spatial analysis. It was initially subjected to K-means clustering techniques and after that the resultant data set was subjected to J48 Decision Tree classification which then again subjected to Association Data mining technique called Apriori. The desired rules were generated after this hybridization is allowed for further analysis. 4. Implementation of Proposed Approach The Crime Data set 20 was subjected to pre-process and lot of pre-processed techniques like data cleaning, Noise removal and inconsistency etc. were applied and then bring it in to the desired format for subjecting to various Hybrid Data Mining techniques. The Pre-processed data set is split in to Training Data set and testing data set with ratio of 70% and 30%. Initially the K-means clustering technique was applied on the pre-processed Training data set and then tested with Testing data set. Fix the number of clusters initial input as 5. The number of clusters is arrived as 5 through statistical results / observations. The spatial data set consists of 12 attributes namely FIR.No., Date, Time, Address, District, Beat, Grid, Ucr_nic_code, Crime, CN, Latitude and Longitude. The 5 Clusters are shown in the following Table.1. CLUSTER 0:The district 6 has associated with Beat 6C and Missing Person & Towed/Stored Vehicle as major Crime with Crime number 0 apart from other thefts as Petty Theft, Hit/Run. The district 6 is associated with Latitude and Longitude values as Centroids. The time is 11.00AM as its Centroid with date as 1/23/2006 and address Centroid as 1 ScrippsDr. CLUSTER 1:The district 2 has associated with Beat 2B and Petty Theft/Inside as major Crime with Crime number 7000 apart from other thefts as Auto Theft, Burglary Residence. The district 2 is associated with Latitude , Longitude values as Centroids. The time is 12.00PM as its Centroid with date as 1/23/2006 and address Centroid as 1689 Ardenway. CLUSTER 2: The district 3 has associated with Beat 3M and Grand Theft as major Crime with Crime number 487(A) PC apart from other thefts as Auto/Vehicle 2 Indian Journal of Science and Technology
3 S. Palaniappan, T. V. Rajinikanth and A. Govardhan Theft. The district 5 is associated with Latitude , Longitude values as Centroids. The time is 8.00AM as its Centroid with date as 1/24/2006 and address Centroid as 170 Shrike Cir. CLUSTER 3: The district 4 has associated with Beat 4A and Take Vehicle W/O Owner as major Crime with Crime number 10851(A) apart from other thefts as Burglary / Theft vehicle, Battery NoncohabSpous. The district 4 is associated with Latitude , Longitude values as Centroids. The time is 10.00AM as its Centroid with date as 1/23/2006 and address Centroid as 1040 Florin Rd. The Ucr_ncic_code is CLUSTER 4: The district 4 has associated with Beat 4B and Towed / Stored Vehicle as major Crime with Crime number apart from other thefts as Burglary / Theft vehicle. The district 4 is associated with Latitude , Longitude values as Centroids. The time is 7.57AM as its Centroid with date as 1/24/2006 and address Centroid as 1342 Palomar Cir. The percentage of data instances associated with Cluster0 is 18%, Cluster1 is 17%, Cluster2 is 30%, Cluster3 is 20% and Cluster 4 is 15%. After this the resultant clustered data set was subjected to hybridization using Decision Tree algorithm called J48 and the results were analyzed effectively for bringing conclusions. The resultant clustered and classified data set was subjected to Association rule mining technique namely Apriori and the final results were analyzed for effective conclusions. 5. Results and Analysis All the clusters are showing that Majority numbers of crimes are Burglary, Theft of Auto/Vehicle, Petty Thefts, Battery NoncohabSpous, Grand Theft and Hit & Run. These kinds of Thefts are associated with all the districts. The Percentage of instances associated with Clusters shows that Cluster 2 has highest percentage followed by Cluster3. The lowest percentage of instances is associated with Cluster4. From Various Decision trees it is inferred that the districts are associated with beat numbers based on the values of Latitude, Longitude and Grid values. One of the decision tree is shown in Figure 1. District 1 is associated with beat number 1A, 1B and 1C. District 2 is majorly associated with beat numbers 2A, 2B, 2C respectively. District 3 is majorly associated with beat numbers 3A, 3B, 3C and 3M. District 4 is majorly associated with beat numbers 4A, 4B and 4C. The district 5 is majorly associated with beat numbers 5A, 5B and 5C respectively. The district 6 is associated with 6A, 6B and 6C.The Figure 2 shows the graphical relationship of districts numbers with beat numbers in which Districts numbers are taken along X-axis and Beat numbers along Y-axis. More number of beats are there in district number 3 followed by 4 and the remaining districts has 3 beats equally. Figure 4 shows 3D Graph for 6 Districts with Latitude and Longitude in which Latitude was taken along X-axis, Longitude was taken along Y-axis and Districts along Z-axis. The Association Algorithm Apriori was applied and found desired rules with various Support and Confidence values. The number of rules generated is shown in Table 2. The numbers of rules are changing based on Support and Confidence values. The Majority of the rules when considered for across different values of Support and confidence are summarized and shown below. R1: ucr_ncic_code=2404 & crime= TAKE VEH W/O OWNER && CN=10851(A)VC & Date=1/23/2006 => Cluster=cluster3 R2: ucr_ncic_code=7000&date= &CN= &Crime= TOWED / STORED VEHICLE=>Cluster=cluster1& Cluster=cluster4 R3: district=6 & Date= & CN=0, Crime = MISSING PERSON/ TOWED / STORED VEHICLE=> Cluster=cluster0 R4: district=2 &district=3 & district=4 &Date= => Cluster=cluster2 One of the 3D Visualization graph of the Association rules are shown in the Figure 3 6. Figures and Tables Figure 1. The Decision tree of Crime Data set. Indian Journal of Science and Technology 3
4 Enhancement of Effective Spatial Data Analysis using R Table 1. Clustered Crime Data Set for 5 clusters Attribute Cluster# Full Data (175.0) 0 (46.0) 1 (65.0) 2 (11.0) 3 (24.0) 4(29.0) FIR No. F3756 F3759 F3756 F3767 F3763 F3757 Date 1/23/2006 1/23/2006 1/23/2006 1/24/2006 1/23/2006 1/24/2006 Time 12:00 PM 11:00AM 12:00PM 8:00AM 10:00AM 7:57 AM Address 1689 Ardenway 1 ScrippsDr 1689 Ardenway 170 Shrike Cir 1040 Florin Rd 1342 Palomar Cir District Beat 6C 6C 2B 3M 4A 4B Grid Ucr_ncic_code Crime TSV MP / TSV PTI GT TVWO TSV CN PC 487(A) PC 10851(A) Latitude Longitude Table 2. Number of Rules generation with different Support and Confidence S.No. Support Confidence Number of Rules Figure 4. 3D Visualization of Association Rules. 7. Conclusion Figure 2. Graph Showing Districts numbers on X-axis and Beat Numbers on Y-axis. Figure 3. 3D Graph for 6 Districts with Latitude and Longitude. The Hybrid Data Mining techniques with K-means Clustering and J48 Decision Tree Algorithm was developed and Applied for the enhancement of accuracy. Association Rule generation Apriori algorithm was applied on the resultant K-means clustered data set. Beat number is associated with type of crime and where as district number is in turn associated with Beat number and Ucr_ncic_code7000. The Decision Tree is also showing that the beat numbers are associated with District numbers based on Latitude, Longitude and Grid attribute values. All the clusters are showing that Majority numbers of crimes are Burglary, Theft of Auto/Vehicle, Petty Thefts, BatteryNoncohabSpous, Grand Theft and Hit & Run. These kinds of Thefts are associated with all the districts. The same is confirmed by the Association rules thus generated. The Cluster3 is associated with 4 Indian Journal of Science and Technology
5 S. Palaniappan, T. V. Rajinikanth and A. Govardhan District 4, beat number 4A and the associated majority crime is Vehicle Theft which in turn associated with crime Number 10851(A)VC dated 1/23/2006 and ucr_ ncic_code=2404. The Cluster 1 & 4 are associated with Districts 2 & 4 with Beat numbers 2B & 4B and the associated major crime is TOWED / STORED VEHICLE which in turn with Crime number dated 1/23/2006 and ucr_ncic_code=7000. Cluster 0 is associated with District 6, beat number 6C and the associated majority crime is MISSING PERSON & TOWED / STORED VEHICLE which in turn associated with Crime Number 0 dated 1/23/2006 and ucr_ncic_code=7000. Districts 2,3 and 4 associated with date 1/23/2006 for the Cluster2. 8. Acknowledgement Thanks to the Spatial Key ( com/) organization in providing the Spatial Crime Data Set as open to researchers to do explorative analysis. 9. References 1. Spatial analysis. Available from: wiki/spatial_analysis. Date accessed: 05/10/ Spatial Data Analysis. Available from: edu/gis/chapter14_notes.pdf. Date accessed: 05/10/ Hemalatha M, Naga Saranya N. A Recent Survey on Knowledge Discovery in Spatial Data Mining. IJCI International Journal of Computer Science May; 8(3): Rao T, Rajasekhar N, Rajinikanth TV. An efficient approach for Weather forecasting using Support Vector Machines International Conference on Intelligent Network and Computing. 2012; 47. p RajiniKanth TV, Anuradha K, Premchand P, Murali Krishna IV. Weather Data Analysis of Rajasthan State using Data Mining Techniques. Journal of Advanced. 2011Apr; 3(2): Data mining. Available from: wiki/data_mining. Date accessed: 07/10/ Spatial Data base. Available from: wiki/spatial_database. Date accessed: 07/10/ Introduction to Spatial Databases. Available from: Date accessed: 07/10/ Rajanikanth J, Rajinikanth TV, Prasad TVKP, Radha Krishna B. Analysis on Spatial Data Clustering Methods - A Case Study Oct- Dec; 3(4): Fayyad UMJ, Piatetsky-Shapiro G, Smyth P. From Data Mining to Knowledge Discovery: An Overview. Advances in Knowledge Discovery and Data Mining, AAAI Press, Menlo Park. 1996; Shekhar S, Zhang P, Huang Y, Vatsavai R. Trend in Spatail Data Mining, as a chapter to appear in Data Mining: Next Generation Challenges and Future Directions Apr- Jun; 4(2): Vijay Kumar A, RajiniKanth TV. Estimation of the Influence of Fertilizer Nutrients Consumption on the Wheat Crop yield in India- a Data mining Approach Dec; 3(2): Vijay Kumar A, RajiniKanth TV. A Data Mining Approach for the Estimation of Climate Change on the Jowar Crop Yield in India Dec; 2(2): Vijay Kumar A, RajiniKanth TV. Estimation of the Influential Factors of rice yield in India. 2nd International Conference on Advanced Computing methodologies Aug; Gueting RH. An Introduction to Spatial Database Systems, Special Issue on Spatial Database Systems of the VLDB Journal Oct; 3(4): Koperski K, Adhikary J, Han J. Knowledge Discovery in Spatial Databases: Progress and Challenges. Proceedings of SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Technical Report 96-08, University of British Columbia, Vancouver, Canada, Zhang P, Steinbach M, Kumar V, Shekhar S, Tan P, Klooster S, Potter C. Discovery of Patterns of Earth Science Data using Data Mining, as a chapter to appear in Next Generation of Data Mining Applications, IEEE Press. 2004; Parimala M, Lopez D, Kaspar S. K-Neighbourhood Structural Similarity Approach for Spatial Clustering. Indian Journal of Science and Technology Sep; 8(23): Rajinikanth TV, Balaram VVSSS, Rajasekhar N. Analysis of Indian temperature data using Data mining Techniques. International Conference Advances in Computing and Information Technology May. p Spatial Key Support. Available from: Date accessed: 09/10/2015. Indian Journal of Science and Technology 5
International Journal of Research in Computer and Communication Technology, Vol 3, Issue 7, July
Hybrid SVM Data mining Techniques for Weather Data Analysis of Krishna District of Andhra Region N.Rajasekhar 1, Dr. T. V. Rajini Kanth 2 1 (Assistant Professor, Department of Computer Science & Engineering,
More informationAlgorithms for Characterization and Trend Detection in Spatial Databases
Published in Proceedings of 4th International Conference on Knowledge Discovery and Data Mining (KDD-98) Algorithms for Characterization and Trend Detection in Spatial Databases Martin Ester, Alexander
More informationExploring Spatial Relationships for Knowledge Discovery in Spatial Data
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Exploring Spatial Relationships for Knowledge Discovery in Spatial Norazwin Buang
More informationInternational Journal of Advance Engineering and Research Development. Review Paper On Weather Forecast Using cloud Computing Technique
Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 4, Issue 12, December -2017 e-issn (O): 2348-4470 p-issn (P): 2348-6406 Review
More informationSoft Spatial Query Processing in Spatial Databases-A Case Study
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 4 (April 2014), PP.47-52 Soft Spatial Query Processing in Spatial Databases-A
More informationDEVELOPMENT OF KNOWLEDGE MINING TECHNIQUES FOR SPATIAL DECISION SUPPORT SYSTEM
International Journal of Computer Engineering & Technology (IJCET) Volume 8, Issue 6, Nov-Dec 2017, pp. 95 105, Article ID: IJCET_08_06_012 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=8&itype=6
More informationAnalysis of Data Mining Techniques for Weather Prediction
ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Indian Journal of Science and Technology, Vol 9(38), DOI: 10.17485/ijst/2016/v9i38/101962, October 2016 Analysis of Data Mining Techniques for Weather
More informationPredictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers
Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 3 (2017) pp. 461-469 Research India Publications http://www.ripublication.com Predictive Analytics on Accident Data Using
More informationMining Climate Data. Michael Steinbach Vipin Kumar University of Minnesota /AHPCRC
Mining Climate Data Michael Steinbach Vipin Kumar University of Minnesota /AHPCRC Collaborators: G. Karypis, S. Shekhar (University of Minnesota/AHPCRC) V. Chadola, S. Iyer, G. Simon, P. Zhang (UM/AHPCRC)
More informationFinding Multiple Outliers from Multidimensional Data using Multiple Regression
Finding Multiple Outliers from Multidimensional Data using Multiple Regression L. Sunitha 1*, Dr M. Bal Raju 2 1* CSE Research Scholar, J N T U Hyderabad, Telangana (India) 2 Professor and principal, K
More informationChicago Crime Category Classification (CCCC)
Chicago Crime Category Classification (CCCC) Sahil Agarwal 1, Shalini Kedlaya 2 and Ujjwal Gulecha 3 Abstract It is very important and crucial to keep a city safe from crime. If the police could be given
More informationDECISION MAKING IN AGRICULTURE BASED ON LAND SUITABILITY SPATIAL DATA ANALYSIS APPROACH
DECISION MAKING IN AGRICULTURE BASED ON LAND SUITABILITY SPATIAL DATA ANALYSIS APPROACH 1 M. PARIMALA, 2 DAPHNE LOPEZ 1 Asst Prof., School of Information Technology & Engineeirng, VIT University 2 Prof.,
More information43400 Serdang Selangor, Malaysia Serdang Selangor, Malaysia 4
An Extended ID3 Decision Tree Algorithm for Spatial Data Imas Sukaesih Sitanggang # 1, Razali Yaakob #2, Norwati Mustapha #3, Ahmad Ainuddin B Nuruddin *4 # Faculty of Computer Science and Information
More informationOutline. Geographic Information Analysis & Spatial Data. Spatial Analysis is a Key Term. Lecture #1
Geographic Information Analysis & Spatial Data Lecture #1 Outline Introduction Spatial Data Types: Objects vs. Fields Scale of Attribute Measures GIS and Spatial Analysis Spatial Analysis is a Key Term
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Analysis of Meteorological Data in of Tamil Nadu Districts Based On K- Means Clustering Algorithm M. Mayilvaganan [1], P. Vanitha [2] Department of Computer Science [2], PSG
More informationINSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad
INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad -00 0 CIVIL ENGINEERING TUTORIAL QUESTION BANK Course Name : Remote Sensing and GIS Course Code : A00 Class : IV B. Tech I Semester
More informationIntroduction to GIS. Dr. M.S. Ganesh Prasad
Introduction to GIS Dr. M.S. Ganesh Prasad Department of Civil Engineering The National Institute of Engineering, MYSORE ganeshprasad.nie@gmail.com 9449153758 Geographic Information System (GIS) Information
More informationSpatial Co-location Patterns Mining
Spatial Co-location Patterns Mining Ruhi Nehri Dept. of Computer Science and Engineering. Government College of Engineering, Aurangabad, Maharashtra, India. Meghana Nagori Dept. of Computer Science and
More informationINTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil
INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil ABSTRACT:- The geographical information system (GIS) is Computer system for capturing, storing, querying analyzing, and displaying geospatial
More informationAutomatic Classification of Location Contexts with Decision Trees
Automatic Classification of Location Contexts with Decision Trees Maribel Yasmina Santos and Adriano Moreira Department of Information Systems, University of Minho, Campus de Azurém, 4800-058 Guimarães,
More informationMichael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall
Announcement New Teaching Assistant Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall 209 Email: michael.harrigan@ttu.edu Guofeng Cao, Texas Tech GIST4302/5302, Lecture 2: Review of Map Projection
More informationModels to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15
Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15 Ring-Shaped Hotspot Detection: A Summary of Results, IEEE ICDM 2014 (w/ E. Eftelioglu et al.) Where is a crime source?
More information1. Introduction. S.S. Patil 1, Sachidananda 1, U.B. Angadi 2, and D.K. Prabhuraj 3
Cloud Publications International Journal of Advanced Remote Sensing and GIS 2014, Volume 3, Issue 1, pp. 525-531, Article ID Tech-249 ISSN 2320-0243 Research Article Open Access Machine Learning Technique
More informationTheory, Concepts and Terminology
GIS Workshop: Theory, Concepts and Terminology 1 Theory, Concepts and Terminology Suggestion: Have Maptitude with a map open on computer so that we can refer to it for specific menu and interface items.
More informationDATA SOURCES AND INPUT IN GIS. By Prof. A. Balasubramanian Centre for Advanced Studies in Earth Science, University of Mysore, Mysore
DATA SOURCES AND INPUT IN GIS By Prof. A. Balasubramanian Centre for Advanced Studies in Earth Science, University of Mysore, Mysore 1 1. GIS stands for 'Geographic Information System'. It is a computer-based
More informationGeospatial data pre-processing on watershed datasets: A GIS approach
Edith Cowan University Research Online ECU Publications Post 2013 2014 Geospatial data pre-processing on watershed datasets: A GIS approach Sreedhar Nallan Edith Cowan University, acharyasree@gmail.com
More informationIntroduction to Geographic Information Systems Dr. Arun K Saraf Department of Earth Sciences Indian Institute of Technology, Roorkee
Introduction to Geographic Information Systems Dr. Arun K Saraf Department of Earth Sciences Indian Institute of Technology, Roorkee Lecture 01 What is Geographic Information Systems? I welcome to this
More informationEasySDM: A Spatial Data Mining Platform
EasySDM: A Spatial Data Mining Platform (User Manual) Authors: Amine Abdaoui and Mohamed Ala Al Chikha, Students at the National Computing Engineering School. Algiers. June 2013. 1. Overview EasySDM is
More informationA Geo-Statistical Approach for Crime hot spot Prediction
A Geo-Statistical Approach for Crime hot spot Prediction Sumanta Das 1 Malini Roy Choudhury 2 Abstract Crime hot spot prediction is a challenging task in present time. Effective models are needed which
More informationCSE 5243 INTRO. TO DATA MINING
CSE 5243 INTRO. TO DATA MINING Data & Data Preprocessing & Classification (Basic Concepts) Huan Sun, CSE@The Ohio State University Slides adapted from UIUC CS412, Fall 2017, by Prof. Jiawei Han Chapter
More informationANN based techniques for prediction of wind speed of 67 sites of India
ANN based techniques for prediction of wind speed of 67 sites of India Paper presentation in Conference on Large Scale Grid Integration of Renewable Energy in India Authors: Parul Arora Prof. B.K Panigrahi
More informationData Mining Classification: Basic Concepts and Techniques. Lecture Notes for Chapter 3. Introduction to Data Mining, 2nd Edition
Data Mining Classification: Basic Concepts and Techniques Lecture Notes for Chapter 3 by Tan, Steinbach, Karpatne, Kumar 1 Classification: Definition Given a collection of records (training set ) Each
More informationSPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM
SPATIAL DATA MINING Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM INTRODUCTION The main difference between data mining in relational DBS and in spatial DBS is that attributes of the neighbors
More informationUsing Image Moment Invariants to Distinguish Classes of Geographical Shapes
Using Image Moment Invariants to Distinguish Classes of Geographical Shapes J. F. Conley, I. J. Turton, M. N. Gahegan Pennsylvania State University Department of Geography 30 Walker Building University
More informationInternational Journal of Intellectual Advancements and Research in Engineering Computations
ISSN:2348-2079 Volume-5 Issue-2 International Journal of Intellectual Advancements and Research in Engineering Computations Agricultural land investigation and change detection in Coimbatore district by
More informationDecision Tree Learning
Decision Tree Learning Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Machine Learning, Chapter 3 2. Data Mining: Concepts, Models,
More informationFeature Extraction Using Zernike Moments
Feature Extraction Using Zernike Moments P. Bhaskara Rao Department of Electronics and Communication Engineering ST.Peter's Engineeing college,hyderabad,andhra Pradesh,India D.Vara Prasad Department of
More informationRoad Network Impedance Factor Modelling Based on Slope and Curvature of the Road
Cloud Publications International Journal of Advanced Remote Sensing and GIS 2017, Volume 6, Issue 1, pp. 2274-2280 ISSN 2320 0243, Crossref: 10.23953/cloud.ijarsg.289 Methodology Article Road Network Impedance
More informationSpatial Statistical Analysis of burglary Crime in Chennai City Promoters Apartments: A CASE STUDY
Spatial Statistical Analysis of burglary Crime in Chennai City Promoters Apartments: A CASE STUDY M.Vijaya Kumar Assistant Professor K.S.R.College of Engineering Tiruchengode Dr.C.Chandrasekar Associate
More informationSpatial Analysis I. Spatial data analysis Spatial analysis and inference
Spatial Analysis I Spatial data analysis Spatial analysis and inference Roadmap Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for analyses: working with
More informationGIS (GEOGRAPHIC INFORMATION SYSTEMS)
GIS (GEOGRAPHIC INFORMATION SYSTEMS) 1 1. DEFINITION SYSTEM Any organised assembly of resources and procedures united and regulated by interaction or interdependence to complete a set of specific functions.
More informationMining Knowledge in Geographical Data. Krzysztof Koperski Jiawei Han Junas Adhikary. spatial access methods.
Mining Knowledge in Geographical Data Krzysztof Koperski Jiawei Han Junas Adhikary Huge amounts of data have been stored in databases, data warehouses, geographic information systems, and other information
More informationBasics of GIS. by Basudeb Bhatta. Computer Aided Design Centre Department of Computer Science and Engineering Jadavpur University
Basics of GIS by Basudeb Bhatta Computer Aided Design Centre Department of Computer Science and Engineering Jadavpur University e-governance Training Programme Conducted by National Institute of Electronics
More informationApplication of Clustering to Earth Science Data: Progress and Challenges
Application of Clustering to Earth Science Data: Progress and Challenges Michael Steinbach Shyam Boriah Vipin Kumar University of Minnesota Pang-Ning Tan Michigan State University Christopher Potter NASA
More informationWeb Visualization of Geo-Spatial Data using SVG and VRML/X3D
Web Visualization of Geo-Spatial Data using SVG and VRML/X3D Jianghui Ying Falls Church, VA 22043, USA jying@vt.edu Denis Gračanin Blacksburg, VA 24061, USA gracanin@vt.edu Chang-Tien Lu Falls Church,
More informationGEOGRAPHIC INFORMATION SYSTEMS
GEOGRAPHIC INFORMATION SYSTEMS 4-H Round-Up Community Transitions Workshop Daniel Hanselka June 14, 2011 Goals of the Workshop Answer the question: What is GIS? Uses of GIS. Some of the Common Terminology
More informationAn Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets
IEEE Big Data 2015 Big Data in Geosciences Workshop An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets Fatih Akdag and Christoph F. Eick Department of Computer
More informationMapcube and Mapview. Two Web-based Spatial Data Visualization and Mining Systems. C.T. Lu, Y. Kou, H. Wang Dept. of Computer Science Virginia Tech
Mapcube and Mapview Two Web-based Spatial Data Visualization and Mining Systems C.T. Lu, Y. Kou, H. Wang Dept. of Computer Science Virginia Tech S. Shekhar, P. Zhang, R. Liu Dept. of Computer Science University
More informationCaesar s Taxi Prediction Services
1 Caesar s Taxi Prediction Services Predicting NYC Taxi Fares, Trip Distance, and Activity Paul Jolly, Boxiao Pan, Varun Nambiar Abstract In this paper, we propose three models each predicting either taxi
More informationExploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data
Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data Jinzhong Wang April 13, 2016 The UBD Group Mobile and Social Computing Laboratory School of Software, Dalian University
More informationModified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context.
Modified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context. Murphy Choy Cally Claire Ong Michelle Cheong Abstract The rapid explosion in retail data calls for more effective
More informationAN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS
AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS James Hall JHTech PO Box 877 Divide, CO 80814 Email: jameshall@jhtech.com Jeffrey Hall JHTech
More informationFirst Exam. Geographers Tools: Automated Map Making. Digitizing a Map. Digitized Map. Revising a Digitized Map 9/22/2016.
First Exam Monday, September 26, 206. Combination of multiple choice questions. Bring a #2 pencil with eraser. Based on class lectures and chapter. If you miss the exam, an essay style make up exam will
More informationCHAPTER-17. Decision Tree Induction
CHAPTER-17 Decision Tree Induction 17.1 Introduction 17.2 Attribute selection measure 17.3 Tree Pruning 17.4 Extracting Classification Rules from Decision Trees 17.5 Bayesian Classification 17.6 Bayes
More informationTo Predict Rain Fall in Desert Area of Rajasthan Using Data Mining Techniques
To Predict Rain Fall in Desert Area of Rajasthan Using Data Mining Techniques Peeyush Vyas Asst. Professor, CE/IT Department of Vadodara Institute of Engineering, Vadodara Abstract: Weather forecasting
More informationPrediction of Seasonal Rainfall Data in India using Fuzzy Stochastic Modelling
Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 13, Number 9 (2017), pp. 6167-6174 Research India Publications http://www.ripublication.com Prediction of Seasonal Rainfall Data in
More informationOrbital Insight Energy: Oil Storage v5.1 Methodologies & Data Documentation
Orbital Insight Energy: Oil Storage v5.1 Methodologies & Data Documentation Overview and Summary Orbital Insight Global Oil Storage leverages commercial satellite imagery, proprietary computer vision algorithms,
More informationOutline. Remote Sensing, GIS and DEM Applications for Flood Monitoring. Introduction. Satellites and their Sensors used for Flood Mapping
Outline Remote Sensing, GIS and DEM Applications for Flood Monitoring Prof. D. Nagesh Kumar Chairman, Centre for Earth Sciences Professor, Dept. of Civil Engg. Indian Institute of Science Bangalore 560
More informationLATUR TOWN ROADWAYS NETWORK ANALYSIS AND PLANNING BY USING GIS AND RS
ISSN 2320-9194 1 International Journal of Advance Research, IJOAR.org Volume 4, Issue 7, July 2016, Online: ISSN 2320-9194 LATUR TOWN ROADWAYS NETWORK ANALYSIS AND PLANNING BY USING GIS AND RS Mr. Dnyaneshwar
More informationLecture 2. A Review: Geographic Information Systems & ArcGIS Basics
Lecture 2 A Review: Geographic Information Systems & ArcGIS Basics GIS Overview Types of Maps Symbolization & Classification Map Elements GIS Data Models Coordinate Systems and Projections Scale Geodatabases
More informationOverview key concepts and terms (based on the textbook Chang 2006 and the practical manual)
Introduction Geo-information Science (GRS-10306) Overview key concepts and terms (based on the textbook 2006 and the practical manual) Introduction Chapter 1 Geographic information system (GIS) Geographically
More informationASSOCIATION RULE MINING AND CLASSIFIER APPROACH FOR QUANTITATIVE SPOT RAINFALL PREDICTION
ASSOCIATION RULE MINING AND CLASSIFIER APPROACH FOR QUANTITATIVE SPOT RAINFALL PREDICTION T.R. SIVARAMAKRISHNAN 1, S. MEGANATHAN 2 1 School of EEE, SASTRA University, Thanjavur, 613 401 2 Department of
More informationSpatial Data Science. Soumya K Ghosh
Workshop on Data Science and Machine Learning (DSML 17) ISI Kolkata, March 28-31, 2017 Spatial Data Science Soumya K Ghosh Professor Department of Computer Science and Engineering Indian Institute of Technology,
More informationGIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis
GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu
More informationGIST 4302/5302: Spatial Analysis and Modeling
GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://thestarlab.github.io Department of Geosciences Texas Tech University
More informationDriving Forces of Houston s Burglary Hotspots During Hurricane Rita
Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Marco Helbich Department of Geography University of Heidelberg Heidelberg, Germany & Michael Leitner Department of Geography and Anthropology
More informationSPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA
SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA D. Pokrajac Center for Information Science and Technology Temple University Philadelphia, Pennsylvania A. Lazarevic Computer
More informationCS570 Data Mining. Anomaly Detection. Li Xiong. Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber.
CS570 Data Mining Anomaly Detection Li Xiong Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber April 3, 2011 1 Anomaly Detection Anomaly is a pattern in the data that does not conform
More informationUSE OF FUZZY LOGIC TO INVESTIGATE WEATHER PARAMETER IMPACT ON ELECTRICAL LOAD BASED ON SHORT TERM FORECASTING
Nigerian Journal of Technology (NIJOTECH) Vol. 35, No. 3, July 2016, pp. 562 567 Copyright Faculty of Engineering, University of Nigeria, Nsukka, Print ISSN: 0331-8443, Electronic ISSN: 2467-8821 www.nijotech.com
More informationAppendix C. AMEC Evaluation of Zuni PPIW. Appendix C. Page C-1 of 34
AMEC s Independent Estimate of PPIW Crop Water Use Using the ASCE Standardized Reference Evapotranspiration via Gridded Meteorological Data, and Estimation of Crop Coefficients, and Net Annual Diversions
More informationFrom statistics to data science. BAE 815 (Fall 2017) Dr. Zifei Liu
From statistics to data science BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Why? How? What? How much? How many? Individual facts (quantities, characters, or symbols) The Data-Information-Knowledge-Wisdom
More informationBelieve it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data
SDM 15 Vancouver, CAN Believe it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data Houping Xiao 1, Yaliang Li 1, Jing Gao 1, Fei Wang 2, Liang Ge 3, Wei Fan 4, Long
More informationIntroducing GIS analysis
1 Introducing GIS analysis GIS analysis lets you see patterns and relationships in your geographic data. The results of your analysis will give you insight into a place, help you focus your actions, or
More informationstatistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI
statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI tailored seasonal forecasts why do we make probabilistic forecasts? to reduce our uncertainty about the (unknown) future
More informationVersion 1.1 GIS Syllabus
GEOGRAPHIC INFORMATION SYSTEMS CERTIFICATION Version 1.1 GIS Syllabus Endorsed 1 Version 1 January 2007 GIS Certification Programme 1. Target The GIS certification is aimed at: Those who wish to demonstrate
More informationThe Quadratic Entropy Approach to Implement the Id3 Decision Tree Algorithm
Journal of Computer Science and Information Technology December 2018, Vol. 6, No. 2, pp. 23-29 ISSN: 2334-2366 (Print), 2334-2374 (Online) Copyright The Author(s). All Rights Reserved. Published by American
More informationIdentifying Road Accidents Severity Problems Using Data Mining Approaches
Identifying Road Accidents Severity Problems Using Data Mining Approaches Mr. P. Sampath*, Dr. Y. Jeevan Nagendra Kumar** * Department of Information Technology, Gokaraju Rangaraju Institute of Engineering
More informationDecision Tree Analysis for Classification Problems. Entscheidungsunterstützungssysteme SS 18
Decision Tree Analysis for Classification Problems Entscheidungsunterstützungssysteme SS 18 Supervised segmentation An intuitive way of thinking about extracting patterns from data in a supervised manner
More informationa system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,
Introduction to GIS Dr. Pranjit Kr. Sarma Assistant Professor Department of Geography Mangaldi College Mobile: +91 94357 04398 What is a GIS a system for input, storage, manipulation, and output of geographic
More informationIntroduction to GIS - 2
Introduction to GIS - 2 Outline Using GIS Representation of spatial objects in GIS Prof. D. Nagesh Kumar Department of Civil Engineering Indian Institute of Science Bangalore 560 012, India http://www.civil.iisc.ernet.in/~nagesh
More informationInvestigation of Road Traffic Fatal Accidents Using Data Mining Techniques
Investigation of Road Traffic Fatal Accidents Using Data Mining Techniques M. Nagaraju *1 B. Bhavani *2 K.Rohith *3 *1,2,3 Assistant Professor Department of Computer Science & Engineering *1,2,3 Nalla
More informationGEOGRAPHIC INFORMATION SYSTEMS (GIS)
AN INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEMS (GIS) Joana Magos Brehm 1, Ali Shehadeh 2 1 Botanical Garden, National Museum of Natural History, University of Lisbon (Portugal) 2 Genetic Resources Unit,
More informationNR402 GIS Applications in Natural Resources
NR402 GIS Applications in Natural Resources Lesson 1 Introduction to GIS Eva Strand, University of Idaho Map of the Pacific Northwest from http://www.or.blm.gov/gis/ Welcome to NR402 GIS Applications in
More informationGeometric Algorithms in GIS
Geometric Algorithms in GIS GIS Software Dr. M. Gavrilova GIS System What is a GIS system? A system containing spatially referenced data that can be analyzed and converted to new information for a specific
More informationENV208/ENV508 Applied GIS. Week 1: What is GIS?
ENV208/ENV508 Applied GIS Week 1: What is GIS? 1 WHAT IS GIS? A GIS integrates hardware, software, and data for capturing, managing, analyzing, and displaying all forms of geographically referenced information.
More informationWEATHER PREDICTION FOR INDIAN LOCATION USING MACHINE LEARNING
Volume 118 No. 22 2018, 1945-1949 ISSN: 1314-3395 (on-line version) url: http://acadpubl.eu/hub ijpam.eu WEATHER PREDICTION FOR INDIAN LOCATION USING MACHINE LEARNING 1 Jitcha Shivang, 2 S.S Sridhar 1
More informationBrief note on GIS and Mapping Training Course. Venue: Arnold Bhavan, 8-11, Ravindra Nagar, Habsiguda, Hyderabad Duration: Aug 2018
Brief note on GIS and Mapping Training Course Venue: Arnold Bhavan, 8-11, Ravindra Nagar, Habsiguda, Hyderabad-500013 Duration: 17-18 Aug 2018 Water and Livelihoods Foundation organized a training program
More informationMachine Learning & Data Mining
Group M L D Machine Learning M & Data Mining Chapter 7 Decision Trees Xin-Shun Xu @ SDU School of Computer Science and Technology, Shandong University Top 10 Algorithm in DM #1: C4.5 #2: K-Means #3: SVM
More informationIntroduction to GIS I
Introduction to GIS Introduction How to answer geographical questions such as follows: What is the population of a particular city? What are the characteristics of the soils in a particular land parcel?
More informationGIS Workshop Data Collection Techniques
GIS Workshop Data Collection Techniques NOFNEC Conference 2016 Presented by: Matawa First Nations Management Jennifer Duncan and Charlene Wagenaar, Geomatics Technicians, Four Rivers Department QA #: FRG
More informationEnvironmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data
Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data Katie Barton, John Choiniere, Melanie Yuen, and Deborah Gross
More informationMonsuru Adepeju 1 and Andy Evans 2. School of Geography, University of Leeds, LS21 1HB 1
Investigating the impacts of training data set length (T) and the aggregation unit size (M) on the accuracy of the self-exciting point process (SEPP) hotspot method Monsuru Adepeju 1 and Andy Evans 2 1,
More informationPrediction of Monthly Rainfall of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM)
Vol- Issue-3 25 Prediction of ly of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM) Deepa Bisht*, Mahesh C Joshi*, Ashish Mehta** *Department of Mathematics **Department
More informationGEOGRAPHY 350/550 Final Exam Fall 2005 NAME:
1) A GIS data model using an array of cells to store spatial data is termed: a) Topology b) Vector c) Object d) Raster 2) Metadata a) Usually includes map projection, scale, data types and origin, resolution
More informationOutline. ArcGIS? ArcMap? I Understanding ArcMap. ArcMap GIS & GWR GEOGRAPHICALLY WEIGHTED REGRESSION. (Brief) Overview of ArcMap
GEOGRAPHICALLY WEIGHTED REGRESSION Outline GWR 3.0 Software for GWR (Brief) Overview of ArcMap Displaying GWR results in ArcMap stewart.fotheringham@nuim.ie http://ncg.nuim.ie ncg.nuim.ie/gwr/ ArcGIS?
More informationMultivariate Analysis of Crime Data using Spatial Outlier Detection Algorithm
J. Stat. Appl. Pro. 5, No. 3, 433-438 (2016) 433 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.18576/jsap/050307 Multivariate Analysis of Crime Data using
More information10/13/2011. Introduction. Introduction to GPS and GIS Workshop. Schedule. What We Will Cover
Introduction Introduction to GPS and GIS Workshop Institute for Social and Environmental Research Nepal October 13 October 15, 2011 Alex Zvoleff azvoleff@mail.sdsu.edu http://rohan.sdsu.edu/~zvoleff Instructor:
More informationArcGIS Pro: Analysis and Geoprocessing. Nicholas M. Giner Esri Christopher Gabris Blue Raster
ArcGIS Pro: Analysis and Geoprocessing Nicholas M. Giner Esri Christopher Gabris Blue Raster Agenda What is Analysis and Geoprocessing? Analysis in ArcGIS Pro - 2D (Spatial xy) - 3D (Elevation - z) - 4D
More informationCS6375: Machine Learning Gautam Kunapuli. Decision Trees
Gautam Kunapuli Example: Restaurant Recommendation Example: Develop a model to recommend restaurants to users depending on their past dining experiences. Here, the features are cost (x ) and the user s
More informationKnowledge Discovery in Spatial Databases
Proceedings of SPIT-IEEE Colloquium and International Conference, Mumbai, India Vol. 4, 101 Knowledge Discovery in Spatial Databases Jyoti Jadhav Shah & Anchor Kutchhi College of Engineering, Chembur,
More information