Seismic source modeling by clustering earthquakes and predicting earthquake magnitudes

Similar documents
Analysis Of Earthquake Records of Istanbul Earthquake Rapid Response System Stations Related to the Determination of Site Fundamental Frequency

P33 Correlation between the Values of b and DC for the Different Regions in the Western Anatolia

C05 Evaluation of Earthquake Hazard Parameters for the Different Regions in the Western Anatolia for Whole Time Periods

School of Computer Science and Engineeering,VIT University,Vellore. Emai: Received on: Accepted on:

EARTHQUAKE PREDICTION BY USING GIS

Uniform Hazard Spectrum(UHS) for performance based seismic design

CHAPTER 3 METHODOLOGY

EARTHQUAKE HAZARD ASSESSMENT IN KAZAKHSTAN

Discrimination of blasts in mine seismology

PSHA results for the BSHAP region

Contribution of HPC to the mitigation of natural risks. B. Feignier. CEA-DAM Ile de France Département Analyse, Surveillance, Environnement

AN OVERVIEW AND GUIDELINES FOR PROBABILISTIC SEISMIC HAZARD MAPPING

Lecture 2. Judging the Performance of Classifiers. Nitin R. Patel

Kinematic Waveform Inversion Study of Regional Earthquakes in Southwest Iberia

Probabilistic Seismic Hazard Analysis in Thailand and Adjacent Areas by Using Regional Seismic Source Zones

The Length to Which an Earthquake will go to Rupture. University of Nevada, Reno 89557

Unique Site Conditions and Response Analysis Challenges in the Central and Eastern U.S.

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time

Hierarchical models for the rainfall forecast DATA MINING APPROACH

DETERMINATION OF EARTHQUAKE PARAMETERS USING SINGLE STATION BROADBAND DATA IN SRI LANKA

Objective: You will learn to interpret a seismogram and, using differences in seismic waves, locate the epicenter of an earthquake.

COMPREHENSIVE GIS-BASED SOLUTION FOR ROAD BLOCKAGE DUE TO SEISMIC BUILDING COLLAPSE IN TEHRAN

Cell-based Model For GIS Generalization

Unraveling the Mysteries of Seismicity in Oklahoma. A Story Map

Identifying fault activation during hydraulic stimulation in the Barnett shale: source mechanisms, b

Probabilistic Seismic Hazard Analysis of Nepal considering Uniform Density Model

Intelligence and statistics for rapid and robust earthquake detection, association and location

National Seismological Centre: An Overview, Prospects and Challenges

SPATIO-TEMPORAL CLUSTER ANALYSIS OF THE EARTHQUAKE EPICENTERS IN TURKEY AND ITS SURROUNDING AREA BETWEEN 1900 AND 2014

Relevance Vector Machines for Earthquake Response Spectra

Optimizing Model Development and Validation Procedures of Partial Least Squares for Spectral Based Prediction of Soil Properties

5. Probabilistic Seismic Hazard Analysis

EARTHQUAKE CLUSTERS, SMALL EARTHQUAKES

Gutenberg-Richter recurrence law to seismicity analysis of Bangladesh

Anomaly Detection for the CERN Large Hadron Collider injection magnets

NEW LOCAL MAGNITUDE CALIBRATION FOR VRANCEA (ROMANIA) INTERMEDIATE-DEPTH EARTHQUAKES

Supplemental Information for. Geospatial analysis of Oklahoma earthquakes ( ): Quantifying the limits of regional-scale earthquake mitigation

A. Akinci 1, M. Murru 1, R. Console 2, G. Falcone 1, S. Pucci 1 1. Istituto Nazionale di Geofisica e Vulcanologia, Rome, Italy 2

UPDATED PROBABILISTIC SEISMIC HAZARD MAPS FOR TURKEY

Vertical to Horizontal (V/H) Ratios for Large Megathrust Subduction Zone Earthquakes

Section 19.1: Forces Within Earth Section 19.2: Seismic Waves and Earth s Interior Section 19.3: Measuring and Locating.

Lab 9: Satellite Geodesy (35 points)

Inquiry: The University of Arkansas Undergraduate Research Journal. Volume 5 Article 15

P32 Temporal and Spatial Variations of b-value in the Western Anatolia

EARTHQUAKE FORECASTING IN BANGLADESH AND ITS SURROUNDING REGIONS

Occurrence of negative epsilon in seismic hazard analysis deaggregation, and its impact on target spectra computation

L. Danciu, D. Giardini, J. Wößner Swiss Seismological Service ETH-Zurich Switzerland

Spatial Extension of the Reality Mining Dataset

Issues in Dependency Modeling in Multi- Unit Seismic PRA

Earthquake prediction through Kannan-Mathematical-Model Analysis and Dobrovolsky-based clustering Technique

Module 7 SEISMIC HAZARD ANALYSIS (Lectures 33 to 36)

WMO Aeronautical Meteorology Scientific Conference 2017

Section Forces Within Earth. 8 th Grade Earth & Space Science - Class Notes

Risk Analysis for Assessment of Vegetation Impact on Outages in Electric Power Systems. T. DOKIC, P.-C. CHEN, M. KEZUNOVIC Texas A&M University USA

2008 Monitoring Research Review: Ground-Based Nuclear Explosion Monitoring Technologies

Quantifying uncertainty of geological 3D layer models, constructed with a-priori

Appendix O: Gridded Seismicity Sources

Ivan Wong Seismic Hazards Group AECOM Oakland, CA. IEAGHG Monitoring Network Meeting. 10 June 2015

Estimation of Peak Ground Acceleration for Delhi Region using Finsim, a Finite Fault Simulation Technique

Estimation of Regional Seismic Hazard in the Korean Peninsula Using Historical Earthquake Data between A.D. 2 and 1995

Overview of Seismic PHSA Approaches with Emphasis on the Management of Uncertainties

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Landslide Susceptibility Mapping Using Logistic Regression in Garut District, West Java, Indonesia

BAD VIBRATIONS (Earthquakes/Tsunamis)

Moment tensor inversion of near source seismograms

10.1 A summary of the Virtual Seismologist (VS) method for seismic early warning

Using Image Moment Invariants to Distinguish Classes of Geographical Shapes

Usually, only a couple of centuries of earthquake data is available, much shorter than the complete seismic cycle for most plate motions.

Seismic Source Characterization in Siting New Nuclear Power Plants in the Central and Eastern United States

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

Earthquakes. Earthquake Magnitudes 10/1/2013. Environmental Geology Chapter 8 Earthquakes and Related Phenomena

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction

Earthquake Hazards in Henderson

Hierarchical Bayesian Modeling of Multisite Daily Rainfall Occurrence

THE RESPONSE SPECTRUM

By Chris Mikes and David Fleck Digital Terrain Analysis GEOG 593. Overview

Knowledge of in-slab earthquakes needed to improve seismic hazard estimates for southwestern British Columbia

South Carolina Seismic Network Bulletin

TSUNAMI HAZARD ASSESSMENT IN NORTHERN EGYPT USING NUMERICAL SIMULATION

Data Mining and Machine Learning

EARTHQUAKE SOURCE PARAMETERS FOR SUBDUCTION ZONE EVENTS CAUSING TSUNAMIS IN AND AROUND THE PHILIPPINES

Seismic Risk And Engineering Decisions (Developments In Geotechnical Engineering) READ ONLINE

Observations on the 2015 February ML 5.0 and July ML 5.4 Central Queensland Earthquake Sequences

Aspects of risk assessment in power-law distributed natural hazards

Chapter 2 Multivariate Statistical Analysis for Seismotectonic Provinces Using Earthquake, Active Fault, and Crustal Structure Datasets

LANDSLIDE SUSCEPTIBILITY MAPPING USING INFO VALUE METHOD BASED ON GIS

Preliminary Earthquake Risk Management Strategy Plan of Eskisehir, Turkey by using GIS

CHAPTER 3 RESEARCH METHODOLOGY

Global GMPEs. Caribbean Regional Programme Workshop Port of Spain, Trinidad & Tobago, May 2 nd 2011

Monthly Prediction of the Vegetation Condition: A case for Ethiopia

GEO-VIII November Geohazard Supersites and Natural Laboratories Progress Report. Document 9

Probabilistic approach to earthquake prediction

CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition

Predictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers

WESTERN STATES SEISMIC POLICY COUNCIL POLICY RECOMMENDATION Definitions of Recency of Surface Faulting for the Basin and Range Province

Core Curriculum Supplement

CPT-BASED SIMPLIFIED LIQUEFACTION ASSESSMENT BY USING FUZZY-NEURAL NETWORK

Historical Maximum Seismic Intensity Maps in Japan from 1586 to 2004: Construction of Database and Application. Masatoshi MIYAZAWA* and Jim MORI

Landslide Mapping and Hazard Analysis for a Natural Gas Pipeline Project

Increase of coal burning efficiency via automatic mathematical modeling. Patrick Bangert algorithmica technologies GmbH 1 Germany

Transcription:

Seismic source modeling by clustering earthquakes and predicting earthquake magnitudes Mahdi Hashemi, Hassan A. Karimi Geoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, 135 North Bellefield Avenue, Pittsburgh, Pennsylvania 15260, U.S. m.hashemi1987@gmail.com, hkarimi@pitt.edu Abstract. Seismic sources are currently generated manually by experts, a process which is not efficient as the size of historical earthquake databases is growing. However, large historical earthquake databases provide an opportunity to generate seismic sources through data mining techniques. In this paper, we propose hierarchical clustering of historical earthquakes for generating seismic sources automatically. To evaluate the effectiveness of clustering in producing homogenous seismic sources, we compare the accuracy of earthquake magnitude prediction models before and after clustering. Three prediction models are experimented: decision tree, SVM, and knn. The results show that: (1) the clustering approach leads to improved accuracy of prediction models; (2) the most accurate prediction model and the most homogenous seismic sources are achieved when earthquakes are clustered based on their non-spatial attributes; and (3) among the three prediction models experimented in this work, decision tree is the most accurate one. Keywords: Clustering; Prediction; Seismic source; Earthquake, Big data. 1 Introduction The study of earthquake ground motions and associated hazards and risks play an important role in sustainable development especially in earthquake-prone areas such as southwestern United States [1; 2]. Reliable evaluation of seismic hazards and risks is a foundation for all earthquake mitigation plans, upon which decision makers can prepare for earthquakes in an optimal way. The first step in any seismic hazard analysis is earthquake source modeling [3; 4]. A single earthquake source is supposed to be uniform in terms of earthquake potential, i.e., the chance of an earthquake of a given magnitude occurring is the same throughout the source. Sources may be linear or areal [4] and are usually used to generate hazard maps and estimate the probability of earthquakes of different magnitudes [5]. Large collections of historical earthquakes have made it possible to construct these sources more efficiently. Seismologists usually determine the boundary of seismic sources manually based on historical earthquakes and tectonic features [6; 7; 8] with no standard or automatic method in place. However, as the size of historical earthquake databases grows, the manual

delineation of source boundaries becomes more cumbersome and less accurate. This calls for development of approaches to automate the same process. Anderson and Nanjo [2] clustered earthquakes based on their distance in space and time and proposed an optimal distance and time interval, obtained experimentally, for clustering earthquakes. Zmazek et al. [9] used a decision tree to predict the radon concentration in soil based on environmental variables. They found that the accuracy of their prediction model changes during seismically active periods comparing with seismically inactive periods. They proposed to predict the time of earthquakes based on this observation. Hashemi and Alesheikh [10] used spatial data mining techniques and indices to reveal the characteristics of earthquakes. They clustered earthquakes around a fault in one class and showed that the earthquake magnitudes in each class are neither spatially correlated nor have any spatial trend, though the earthquakes themselves are strongly clustered at multiple distances. They suggested, as future research, developing prediction models of earthquake characteristics. The work in this paper is focused on developing a methodology for generating areal seismic sources based on historical earthquakes. Different from previous approaches, the proposed methodology benefits from hierarchical clustering technique [11; 12; 13] and is for the purpose of automating the process. Faults, tectonic features and linear sources are not considered in this work. Three clustering approaches are explored: a) hierarchical clustering only based on non-spatial attributes, b) hierarchical clustering only based on location, and c) hierarchical clustering based on all attributes. The purpose of clustering is to categorize similar events together. When events are earthquakes, this process coincides with the purpose of seismic source modeling. Thus, assuming similar earthquakes are clustered correctly, one should be able to develop more accurate prediction models in each cluster than without clustering. The proposed prediction model in this work aims to predict the magnitude of an earthquake based on its other characteristics. A different prediction model is required for each cluster. Assuming the first clustering approach (a above) results in n clusters, there should be n prediction models, one for each cluster. Consequently if the second and third clustering approaches (b and c above) result in m and k clusters, respectively, there should be m prediction models for the second one and k prediction models for the third one. Decision tree, SVM and knn [11; 12; 13] are three different prediction models experimented in this work, resulting in a total of 3 n m k different prediction models. These prediction models are evaluated using 10-fold cross validation. The accuracy of a prediction model not only reveals the strength and suitability of the applied prediction model (decision tree, SVM or knn), but also demonstrates the effectiveness of clustering in producing homogenous seismic sources. Thus, by comparing and analyzing the evaluation results, suggestions are made at the end of this article regarding appropriate clustering approaches and prediction models for earthquakes. Fig. 1 shows the process of clustering earthquakes and predicting earthquake magnitudes used in this work.

Data Earthquakes with a magnitude of greater than 4 in USA since Apr 18, 1906 until Jan 01, 2015 Variables: Magnitude, depth, occurrence date, longitude, latitude Preprocessing Adding a new variable called NDays which is a conversion of occurrence date to the number of days passed since 1900 Normalizing variables Hierarchical clustering Based on magnitude, depth, and NDays Based on longitude and latitude Based on magnitude, depth, Ndays, longitude, and latitude Separating earthquakes in different clusters to different datasets Separating earthquakes in different clusters to different datasets Separating earthquakes in different clusters to different datasets Cluster 1... Cluster n Cluster 1... Cluster n Cluster 1... Cluster n Adding a new variable called Gap to the sub-dataset which shows the number of days passed since the previous earthquake Predicting earthquake magnitude based on variables used for clustering Decision tree approach SVM approach knn approach 10-fold cross validation 10-fold cross validation 10-fold cross validation RMSE RMSE RMSE Discussion and comparison Fig. 1. Process of clustering earthquakes and predicting their magnitudes. 2 Data Earthquakes with a magnitude of greater than 4 in the United States between April 18, 1906 and January 1, 2015 were downloaded from the United States Geological Survey (USGS) website [14]. This is the longest time range available in the database at the time of writing this article in January 2015. The dataset contains 5,368 earthquakes. Table 1 shows the description of each attribute in the dataset [15].

Table 1. Available attributes for earthquakes. Variable Longitude Latitude Magnitude Depth Description Decimal degrees longitude. Negative values for western longitudes. Decimal degrees latitude. Negative values for southern latitudes. The magnitude for the event. Depth of the event in kilometers. Although the occurrence dates of earthquakes are available in the original dataset, they are formatted here as the number of days passed since 1900 and called NDays in the dataset. Since all earthquakes in the dataset have occurred after 1900, all values for this variable are positive. An important step before clustering data points or developing prediction models is normalizing variables. To normalize a variable (e.g., Magnitude in the dataset), the values are transformed to a normal distribution with a mean of zero and standard deviation of one. Equation 1 shows this normalization where is the mean and s is the standard deviation of data. / (1) This step is important because if the range of one variable is much larger than the range of other variables, it will dominate the clustering and prediction process. By normalizing all variables to the same scale, their contribution in the clustering and prediction models is homogenized. 3 Clustering Hierarchical clustering technique is chosen for clustering earthquakes because unlike k-means technique it is not sensitive to initial seeds [11]. The distance between two clusters during hierarchical clustering can be calculated using different methods. Average-link method is chosen here because unlike single-link and complete-link methods it is less sensitive to outliers. However, both advantages (not being sensitive to initial seeds and being less sensitive to outliers) come with computational cost [12; 13]. The earthquakes are clustered in 10 classes. If a class contains only one earthquake, that earthquake is eliminated and the clustering process is repeated until each cluster contains more than one earthquake. This iterative elimination process helps filter out outliers. Clustering Based on Non-spatial Variables The earthquakes are clustered based on their magnitude, depth and occurrence date, i.e., earthquakes which have close magnitudes, depths and occurrence dates are more probable to be in the same cluster. At the first iteration, three clusters contained only one earthquake. These three earthquakes were removed and the clustering process

was repeated. In the second iteration, there was one cluster with one earthquake. This earthquake was eliminated. In the third iteration, all clusters contained more than one earthquake. Clustering Based on Spatial Variables The earthquakes are clustered based on their location (longitude and latitude). There is no need to normalize the variables (columns) for this clustering because the distances in longitude and latitude are compatible with and compensate each other. The resultant clusters contained more than one earthquake in the first iteration. Clustering Based on All Variables The earthquakes are clustered based on their magnitude, depth, occurrence date, longitude, and latitude. At the first iteration, three clusters contained only one earthquake. These three earthquakes were removed and the clustering process was repeated. In the second iteration, all clusters contained more than one earthquake. 4 Earthquakes Clusters and Magnitude Prediction Since the prediction model is developed for earthquakes in each cluster independently and separately, earthquakes in each cluster are moved to a new dataset. Thus, the number of sub-datasets is equal to the number of clusters, the union of sub-datasets is the original dataset and the intersection of sub-datasets is empty. One of the variables required for predicting the magnitude of earthquakes is the number of days passed since the last earthquake. We call this variable Gap and add it to each sub-dataset separately. To calculate Gap, first the sub-dataset is ordered in an ascending order based on NDays. NDays is representative of the earthquake occurrence date as the number of days passed since 1900. Gap for an earthquake is equal to its NDays subtracted by the NDays of its immediate predecessor in the subdataset. Since Gap cannot be calculated for the first earthquake, it is removed from the sub-dataset. A prediction model is developed for each sub-dataset to predict the magnitude based on other predictors. The prediction model for the sub-dataset is evaluated using 10- fold cross validation and root mean square error (RMSE) is calculated to evaluate the accuracy of the prediction model. As mentioned before, there are 10 sub-datasets for each dataset, each includes earthquakes of a specific cluster. Consequently, there will be 10 different prediction models with 10 different RMSEs. However, to achieve a single RMSE for the entire dataset (including 10 sub-datasets), the weighted average of these 10 RMSEs is calculated. The weight is the number of earthquakes in the subdataset. Three different prediction models are experimented: decision tree, SVM, and knn.

5 Results Figs. 2, 3, and 4 show the size of each cluster (logarithmic scale) in three different clusterings explained in Section 3. The cluster sizes are closer to each other in Fig. 3 compared to the other two cases in Figs. 2 and 4. This observation shows that earthquakes are distributed in a few clusters almost uniformly in terms of their locations, though, in terms of their magnitude, depth and occurrence date, most earthquakes (80%) are in one class while the rest of them are distributed in nine other classes. 4 3 2 1 0 1 2 3 4 5 6 7 8 9 10 Fig. 2. Size of each cluster (log 10 scale) when clustering based on non-spatial variables. 4 3 2 1 0 1 2 3 4 5 6 7 8 9 10 Fig. 3. Size of each cluster (log 10 scale) when clustering based on location.

4 3 2 1 0 1 2 3 4 5 6 7 8 9 10 Fig. 4. Size of each cluster (log 10 scale) when clustering based on all variables. Figs. 5, 6, and 7 show the spatial distribution of earthquakes colored based on their clusters. In Fig. 5, earthquakes are colored based on non-spatial attributes (magnitude, depth, and occurrence date) clusters. In Fig. 6, earthquakes are colored based on location clusters. In Fig. 7, earthquakes are colored based on all variables clusters. Lines in these figures are faults. When earthquakes are clustered based on their nonspatial attributes, the geographical distribution of clusters seems random and does not follow the location of faults. In other words, the earthquakes of one cluster may be located in different parts of the region. On the other hand, when earthquakes are clustered only based on their locations, clusters follow the faults. This is compatible with the concept that earthquakes are stacked around faults [10]. Finally, when both spatial and non-spatial attributes of earthquakes are taken into account for clustering, additional factors affect the geographical distribution of clusters. In areas with low seismicity (lower number of earthquakes), most earthquakes belong to one cluster and there are rarely earthquakes of other clusters. However, in areas with high seismicity, such as southwestern U.S., earthquakes of different clusters are stacked together. This observation shows that in areas with low seismicity, geographical location of earthquakes dominates the clustering but in seismically active areas, with dense historical earthquakes, other non-spatial attributes dominate the clustering.

Fig. 5. Clustering earthquakes based on non-spatial attributes. Fig. 6. Clustering earthquakes based on location.

Fig. 7. Clustering earthquakes based on all attributes. The results for different prediction models and clustering criteria are shown in Table 2. The RMSE (last column in the table) indicates the accuracy of the prediction model. This RMSE is obtained throughh 10-fold cross validation. Following are the steps to calculate the RMSE of 0.329 in the first row of Table 2: Obtain the earthquakee clusters (10 in total) based on magnitude, depth, and occurrence date. Develop one prediction model for each of these 10 clusters. Evaluate each prediction model using 10-fold cross validation and calculate a RMSE. Calculate the weighted average of ten RMSEs which is 0.329. The accuracy measure (RMSE) is affected by variables considered for clustering and the prediction model. According to Table 2, knn is obviously not a good prediction model because its RMSE is much larger than the RMSEs for the other two prediction models. Decision tree has a slightly smaller RMSE than SVM. Besides, decision treee is a much faster prediction model than SVM [11; 12; 13]. When earthquakes are clustered based on their non-spatial attributes (magnitude, depth, and occurrence date) and only depth and Gap are used to predict the magnitude, the least RMSE (highest accuracy) is achieved. When earthquakes are clustered based on their location and only their location is used to predict the magnitude, the worst accuracy is observed. When both non-spatial attributes and location of earthquakes are considered for clustering and depth, Gap and location of earthquakes are used together to predict their magnitude, the observed RMSE is close to the average of the two previous cases. With these results, it can be concluded that taking the locations of earthquakes into account has an adverse effect on accuracy of the prediction model.

Table 2. Results of different prediction models after clustering. Clustering criteria Magnitude, depth, and occurrence date Longitude and latitude Magnitude, depth, occurrence date, longitude, and latitude Variables used in prediction of magnitude Depth and Gap Longitude and latitude Depth, Gap, longitude and latitude Prediction RMSE model Decision tree 0.329 SVM 0.339 knn (k=10) 6.905 Decision tree 0.541 SVM 0.564 knn (k=10) 8.262 Decision tree 0.4386 SVM 0.460 knn (k=10) 11.068 Table 3 shows the RMSE of magnitude prediction models without clustering earthquakes. In other words, all earthquakes are considered as one cluster. Comparing the accuracy (RMSE) of prediction models in Tables 2 and 3 shows the effect of clustering on the accuracy of prediction models. According to these two tables, clustering earthquakes decreases the RMSE (improves the accuracy of the magnitude prediction model) by 30% on average over all cases. This observation confirms that clustering earthquakes has been partly successful in generating homogeneous seismic sources. Clustering earthquakes based on their non-spatial attributes (magnitude, depth, and occurrence date) results in the least RMSEs over all different prediction models compared to clustering earthquakes based on spatial or all criteria. In other words, clustering earthquakes based on non-spatial attributes produces the most homogenous hazard zones. Table 3. Results of different prediction models without clustering. Variables used in prediction of magnitude Depth and Gap Longitude and latitude Depth, Gap, longitude and latitude Prediction RMSE model Decision tree 0.502 SVM 0.527 knn (k=10) 10.143 Decision tree 0.545 SVM 0.571 knn (k=10) 13.884 Decision tree 0.505 SVM 0.526 knn (k=10) 13.765 6 Conclusions and Future Directions The most accurate earthquake magnitude prediction model is obtained when the earthquakes are clustered based on their depth, occurrence date and magnitude and the predictors in the prediction model are depth and Gap (number of days passed since the last earthquake in a specific cluster). Adding location of earthquakes to the

clustering criteria and predictors weakens the prediction model. Among the three prediction models experimented in this work to predict the magnitude of earthquakes, decision tree was 95% more accurate than knn and 4% more accurate than SVM in terms of RMSE. Clustering earthquakes reduced all RMSEs by 30% on average which shows clustering earthquakes, as proposed in this work, is a potential approach in producing homogenous seismic sources which can later be used for producing hazard maps. It is also shown that clustering earthquakes based on their non-spatial attributes (magnitude, depth, and occurrence date) produces the most homogenous seismic sources compared to other clustering criteria. Clustering earthquakes based on their non-spatial attributes (magnitude, depth, and occurrence date) resulted in one large cluster and many small clusters. Clustering inside the largest cluster was considered, discarding the other small clusters. However, the results are not shown in this article because it resulted in one very large cluster and other very small clusters. This observation implies that the actual clusters are circularly nested inside each other and cannot be separated using regular k-means or hierarchical clustering approaches. However, this hypothesis requires further investigation and is a future research direction. References 1. Scholz, C. H. (2010). Large earthquake triggering, clustering, and the synchronization of faults. Bulletin of the Seismological Society of America, 100 (3), 901-909. 2. Anderson, J. G., & Nanjo, K. (2013). Distribution of Earthquake Cluster Sizes in the Western United States and in Japan. Bulletin of the Seismological Society of America, 103 (1), 412-423. 3. Cornell, C. (1968). Engineering seismic risk analysis. Bulletin of Seismological Society of America, 58, 1583 1606. 4. Reiter, L. (1990). Earthquake hazard analysis, Issues and insights. New York: Columbia University Press. 5. Anagnos, T., & Kiremidjian, A. S. (1988). A review of earthquake occurrence models for seismic hazard analysis. Probabilistic Engineering Mechanics, 3 (1), 3-11. 6. Hashemi, M., Alesheikh, A. A., & Zolfaghari, M. R. (2013). A spatio-temporal model for probabilistic seismic hazard zonation of Tehran. Computers & Geosciences, 58, 8-18. 7. Erdik, M., Biro, Y. A., Onur, T., Sesetyan, K., & Birgoren, G. (1999). Assessment of earthquake hazard in Turkey and neighboring regions. Annali Di Geofisica, 42 (6), 1125-1138. 8. Erdik, M., Demircioglu, M., Sesetyan, K., Durukal, E., & Siyahi, B. (2004). Earthquake hazard in Marmara Region, Turkey. Soil Dynamics and Earthquake Engineering, 24, 605 631. 9. Zmazek, B., Todorovski, L., Džeroski, S., Vaupotič, J., & Kobal, I. (2003). Application of decision trees to the analysis of soil radon data for earthquake prediction. Applied Radiation and Isotopes, 58 (6), 697-706. 10. Hashemi, M., & Alesheikh, A. (2011). Spatio-temporal analysis of Tehran s historical earthquakes trends. In Proceedings of Advancing Geoinformation Science for a Changing World (pp. 3-20). Utrecht, Netherlands: Springer. 11. Ledolter, J. (2013). Data mining and business analytics with R. Wiley. 12. Conway, D., & White, J. (2012). Machine learning for hackers. O'Reilly Media.

13. Liu, B. (2007). Web data mining: exploring hyperlinks, contents, and usage data. Springer. 14. United States Geological Survey (USGS). (2015). Retrieved from http://earthquake.usgs.gov/earthquakes/search/ 15. United States Geological Survey (USGS). (2015). Retrieved from http://earthquake.usgs.gov/earthquakes/feed/v1.0/glossary.php