Internet Engineering Jacek Mazurkiewicz, PhD

Size: px
Start display at page:

Download "Internet Engineering Jacek Mazurkiewicz, PhD"

Transcription

1 Internet Engineering Jacek Mazurkiewicz, PhD Softcomputing Part 11: SoftComputing Used for Big Data Problems

2 Agenda Climate Changes Prediction System Based on Weather Big Data Visualisation Natural Language Processing Methods Used for Automatic Prediction Mechanism of Related Phenomenon

3 Climate Changes Prediction System Based on Weather Big Data Visualisation Antoni Buszta Dolby Poland Sp. z o.o. Jacek Mazurkiewicz Department of Computer Engineering Wrocław University of Technology, POLAND

4 Goal & Assumptions Goal: - new approach to weather forecasting, - prediction process consisted of processing big data, - turning processed data to visualization, - visualization has been used for enhancing forecasting methods using artificial neural networks. Assumptions: data visualization gives additional interpretation possibilities, it is possible to enhance weather forecasting by data visualization, neural networks can be used for visual weather data analysis, neural networks can be used for climate changes prediction.

5 Classic Weather Data Analysis classic weather forecasting 650BC Babylonians started to predict weather from cloud patterns even early civilizations used reoccurring events to monitor changes in weather and seasons five main weather forecasting methods Persistence Trends Climatology Analog Numerical Weather Prediction

6 Five Main Weather Forecasting Methods Persistence - the most simple and primitive method, - conditions will not change during whole time forecast, - well performing for regions where weather does not change - sunny and dry regions of California in the USA Trends - basing on speed and directions of weather changes, - simple trend model can be calculated, - atmospheric fronts, location of clouds, precipitation, - high and low pressure centers are main factors, - accurate for forecast weather only few hours ahead, - detecting storms or upcoming weather improvement

7 Five Main Weather Forecasting Methods Climatology - bases on statistics and uses long term data accumulated, - take into account wider spectrum of changes occurring, - can provide more accurate local predictions, - main disadvantage is ignoring extremes, - whole model output is averaged, - does not recalculate on radical climate changes - it can provide year by year incorrect output Analog - more complicated than previous ones, - involves collecting data about current weather, - trying to find analogous conditions in the past, - assumes - current situation same as in archival data Numerical Weather Prediction - mathematical models of the atmosphere to calculate possible forecast - short and long term weather predictions

8 Weather Visualization Methods satellite photo visualization of pressure using isobars visualization of temperature using isotherms visualization of wind using isotachs visualization of dew point using isodrosotherms

9 Weather Data Set HadCRUT3 records of the global temperature Intergovernmental Panel on Climate Change (IPCC) data designated by the World Meteorological Organization (WMO) contains monthly temperature values for more that 3000 land stations National Climatic Data Center from Asheville, NC, USA reports collecting global summary of the day data (GSOD) data which creates the Integrated Surface Data (ISD) daily summaries is obtained from USAF Climatology Center beginning from 1929, over 9000 land and sea stations are monitored both of this data sources are available freely for non-commercial use daily reports from GSOD contain multiple collectible data types which can serve as perfect input to big data processing data was provided in Imperial units but in later processing it have been converted to Metric system

10 Neural Network Design (1) Configuration description Input parameters Input neurons Output parameter Output neurons Seven days forecast based temperature only. This method is very similar to analog forecasting Temperature 7 Temperature 1 Seven days forecast based on temperature and number of day in the year. Similar to previous one, but considers also transience of seasons by knowing which part of the year it is Temperature day of the year 8 Temperature 1 One day forecast based on temperature, mean station pressure and wind speed for that day. Analog forecasting aware of multiple weather parameters Temperature mean station pressure for the day and mean wind speed 3 Temperature 1

11 Neural Network Design (2) - input temperature from previous two days, pressure and wind speed from one previous day should provide one day temperature forecast, algorithm needs to keep history of three days of measurements, two days will be used as neural network input, last day will be used for validation or testing if provided answer is correct., neural network build for the purposes of forecasting application was trained using most classic method of training backpropagation, neural network is not being trained until convergence, it is impossible to find optimal solution and create model which will perfectly correspond weather conditions in any situation.

12 Learning Benchmarks (1) days of temperature measurements one look ahead temperature prediction 7 input neurons and 1 output neurons three places in the world were chosen to benchmark neural network performance and effectiveness these cities were Opole, Helsinki and Mexico City choice has been made based on how fast is weather changing within one year in each place what is more in these cities there has been collected enough meteorological data for creating representative dataset and later neural network Opole is a city in southern Poland on the Oder River (Odra), can be characterized by moderate winters and summers Helsinki is the capital and largest city of Finland located in southern Finland, on the shore of the Gulf of Finland, an arm of the Baltic Sea, continental climate and it has lower minimal temperature than previous one Mexico City is capital of Mexico. It has a subtropical highland climate, and due to high altitude of 2,240 m temperature below 3oC and above 30oC

13 Learning Benchmarks (2)

14 Climate Change Prediction (1) - - weather prediction was targeted to whole year 2013, it was the last year for which dataset with complete input to output relations could be acquired, first forecast was created based on 7 day temperature input, additionally neural network had information which part of the year it was, by providing number of the day in year as one of the network inputs, model created for Opole city - data beginning with 1973 and ended with 2012, year 2013 was used for testing forecasting possibilities better results are in months where temperature is closer to yearly average, for Helsinki - neural network better performed in summer, atmospheric conditions were moderate, algorithms were not correctly detect temperature drop in end of February, best results for Mexico City.

15 Climate Change Prediction (2)

16 Climate Change Prediction (3)

17 Climate Change Prediction (4)

18 Conclusions results satisfactory and proved that it is possible to use neural network for weather forecasting big data visualization on the other hand provided easy access to data which in another way would be hard to process analysis showed that generative algorithms should be used in favor of iterative ones due to better processing speed and lower memory consumption executed test and benchmarks, and further analysis have shown that developed application creates valid forecasting reports and it can be successfully used for prediction of multiple weather parameters

19 Natural Language Processing Methods Used for Automatic Prediction Mechanism of Related Phenomenon Krystian Horecki Nokia Networks, Technology Center Wroclaw, POLAND Jacek Mazurkiewicz Department of Computer Engineering Wrocław University of Technology, POLAND

20 Goal & Assumptions Goal: an idea to combine variety of Natural Language Processing techniques with different classification methods as a tool for automatic prediction mechanism of related phenomenon paper proposes the new, promising ways of automatic data and content mining methods for the big data systems presented accuracy results are much better than average classification for sentimental analysis done by the human Assumptions: different types of preprocessing techniques are used and verified, in order to find the best set of them approach allows to recognize the phenomenon related to the text the real input from the big data systems the news website articles are the source of raw text data

21 Motivation millions of texts in web hard to determine meaning and reason of text existence it is important to understand what is happening around you automatically many existing text mining solutions (Angoss, SAS and many more) well known and researched Machine Learning techniques lack of solutions connecting text analysis with related phenomenon no existing solutions covering paper topic web text analysis finding related phenomenon based on text parameters usage of Machine Learning techniques usage of Natural Language Processing techniques

22 Input Data / Text Filtering Levels (1) real articles from web automatically extracted manually categorized sentimental analysis data stored in database after processing more than 1000 articles Level 1 Punctuation and stop words filtering Short words filtering Lemmatisation Level 2 Related words filtering Level 3 Adjectives to nouns transformation

23 Input Data / Text Filtering Levels (2) Level 1 - filtering is mostly focused on short words, stop words and punctuation marks removal, includes also conversion to lowercase - it also uses lemmatisation techniques which were used to reduce number of similar words Level 2 - filtering is based on semantic trees analysis and removal of similar words according to neighborhood in the tree, - technique which uses semantic trees to reduce words number - it was the authors idea to combine such extensive related words merging with text categorization Level 3 - filtering is connected with removal of adjectives and replacing them with corresponding nouns, - method should give accuracy enhancement in case of usage words such polish and Poland, so we could get the same word two times instead of having different words - authors idea was to combine this method with text categorization

24 Level 2 Filtering Algorithm

25 Natural Language Processing dividing text into tokens - called also as tokenization, usage of ready-to-use semantic trees, usage of text dictionaries, frequency distribution analysis, lemmatization each of listed methods should give additional accuracy enhancement that should be examined by testing different filtering levels most of them taken from the NLTK library some required additional custom implementation the usage of the ready-to-use dictionaries and corpuses which contain already collected data for different purposes two corpuses used during the implementation: a stop words dictionary and a wordnet dictionary wordnet corpus was the most important one because it allowed to analyze the relations between examined words having such large lexical database of English it was possible to match words having the same meaning but different form

26 Classification Mechanisms (1) each classifier has unified interface and can be used separately interface should contain: learning method, testing method, classifying method learning method receive a training set and a train classifier object classifier instance can be tested with the test method that receives test data set and returns an average accuracy for all test set elements it is possible to classify one feature vector classifiers were used to test whether our additional techniques improved the classification Neural Network Classifier was implemented as Multilayer Perceptron with custom parameters such as a number of hidden neurons, a number of hidden layers, a number of input and output neurons, a type of the network and types of the activation functions Max Entropy Classifier was used with improved Iterative scaling algorithm without Gaussian prior Naive Bayes Classification Algorithm - due to simplicity and popularity Classifier based on Decision Tree with maximum depth of 100 and the use of a single n-way branch for each feature

27 Classification Mechanisms (2) testing of the classifiers was designed and implemented that each classifier could be tested with different parameters core part of the test mechanism is a definition of test case where user can put data regarding the values of parameters which are later used by the classifiers such approach makes it possible to check how different classifiers behave against changing parameters testing is done by first shuffling dataset and later by splitting it into 2 parts classifier is trained with a training part of the data set and later is tested with the test data set average accuracy of classification is a result for the classifier testing testing is done for each filtering level each classifier is tested many times with shuffled data set which removes chance of wrong results and let user calculate an average accuracy from those many iterations results of each test case, each classifier, each filtering level are stored in database

28 Algorithm for Classifiers Testing

29 Multilayer Perceptron Classifier test cases examined the relation between a number of training epochs for Multilayer Perceptron Classifier and the classification accuracy possibly minimal number of training epochs can make learning time shorter and it means that training is more efficient number of training epochs was set as a range of values between 1 and 20 number of features which were extracted from the data set was set to 100 only 30 most informative features selected using Naive Bayesian Classifier. test for each number of epochs was repeated 15 times in order to get average results test gave very important outcome which is information that any number of training epochs bigger than 3 can give proper classification results that made later tests much shorter conclusion is that neural networks does not have to be trained with big amount of learning epochs when the big amount of data is used for the training.

30 Maxent Classifier test cases examined the relation between number of training epochs for Maxent Classifier and classification accuracy number of iterations was examined in order to get information when a number of iterations is sufficient number of training iterations was set as a range of values:1 and 20 number of features which were extracted from data set: most informative features selected using Naive Bayesian Classifier test for each number of epochs was repeated 15 times in order to get average results for Maxent Classifier stable classification accuracy after the 9th iteration that classification accuracy for the 3rd level of input data filtration was better during early training stages, it can be noticed that results in iterations between 10 and 20 are quite similar for all filtering data set used for testing contained 1039 articles, each article had at least 200 words

31 Results of test for relation between classification accuracy and number of feature words in case of Naive Bayesian Classifier

32 Results of test for relation between classification accuracy and number of most informative feature words in case of Naive Bayesian Classifier

33 Results of test for relation between classification accuracy and number of feature words in case Level 3 of filtering and different classifiers

34 Results of test for relation between classification accuracy and number of most informative feature words in case Level 3 of filtering and different classifiers

35 Summary it is very important how the raw data is prepared before providing it to a classifier - some classifiers are less sensitive to information noises which are included in unfiltered data. Naive Bayesian Classifier provides decent accuracy with the lowest training set size and training effort. promising results using Maxent Classifier since it proved to be not sensitive to noise as in case of Naive Bayesian model, usage of additional features filtering methods and big amount of features can possibly give very good results it is clear that usage of Naive Bayesian Classifier with the 3rd input data filtering level and features filtering is probably the best method to be used for text data classification it was proved that it is possible to build a system that can be trained to recognize given phenomenon using Natural Language Processing and machine learning techniques we found additional techniques which helped us to improve classification accuracy using different classification models, we performed proved that additional techniques allowed to enhance accuracy of the classification for each type of the classification model. the most useful and efficient classifier seems to be Naive Bayes, since it combines high training speed, ability to work with small data sets and high classification accuracy. it is promising that methods created by us proved to give accuracy gain, it is important that the accuracy results which were gained during automatic classification of articles are much better than average classification for sentimental analysis done by the human. we conducted our research on one phenomenon, however it is theoretically possible to apply existing methodology and implementation to other phenomena. it would be very good to execute tests against other types of phenomena and check how classifiers and filtering methods behave within such conditions the suggested future usage of Support Vector Machine Classifier could give promising results since this classifier is popular in sentimental analysis and proved to be the best in terms of categorization accuracy further filtering algorithm enhancement is also possible. Filtering of raw text could be extended with additional procedures using additional linguistic elements such as adjectives and more sophisticated search of related words.

Local Prediction of Precipitation Based on Neural Network

Local Prediction of Precipitation Based on Neural Network Environmental Engineering 10th International Conference eissn 2029-7092 / eisbn 978-609-476-044-0 Vilnius Gediminas Technical University Lithuania, 27 28 April 2017 Article ID: enviro.2017.079 http://enviro.vgtu.lt

More information

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data

More information

To Predict Rain Fall in Desert Area of Rajasthan Using Data Mining Techniques

To Predict Rain Fall in Desert Area of Rajasthan Using Data Mining Techniques To Predict Rain Fall in Desert Area of Rajasthan Using Data Mining Techniques Peeyush Vyas Asst. Professor, CE/IT Department of Vadodara Institute of Engineering, Vadodara Abstract: Weather forecasting

More information

Application of Text Mining for Faster Weather Forecasting

Application of Text Mining for Faster Weather Forecasting International Journal of Computer Engineering and Information Technology VOL. 8, NO. 11, November 2016, 213 219 Available online at: www.ijceit.org E-ISSN 2412-8856 (Online) Application of Text Mining

More information

Possible Applications of Deep Neural Networks in Climate and Weather. David M. Hall Assistant Research Professor Dept. Computer Science, CU Boulder

Possible Applications of Deep Neural Networks in Climate and Weather. David M. Hall Assistant Research Professor Dept. Computer Science, CU Boulder Possible Applications of Deep Neural Networks in Climate and Weather David M. Hall Assistant Research Professor Dept. Computer Science, CU Boulder Quick overview of climate and weather models Weather models

More information

WMO Aeronautical Meteorology Scientific Conference 2017

WMO Aeronautical Meteorology Scientific Conference 2017 Session 1 Science underpinning meteorological observations, forecasts, advisories and warnings 1.6 Observation, nowcast and forecast of future needs 1.6.1 Advances in observing methods and use of observations

More information

MODELLING ENERGY DEMAND FORECASTING USING NEURAL NETWORKS WITH UNIVARIATE TIME SERIES

MODELLING ENERGY DEMAND FORECASTING USING NEURAL NETWORKS WITH UNIVARIATE TIME SERIES MODELLING ENERGY DEMAND FORECASTING USING NEURAL NETWORKS WITH UNIVARIATE TIME SERIES S. Cankurt 1, M. Yasin 2 1&2 Ishik University Erbil, Iraq 1 s.cankurt@ishik.edu.iq, 2 m.yasin@ishik.edu.iq doi:10.23918/iec2018.26

More information

Hierarchical models for the rainfall forecast DATA MINING APPROACH

Hierarchical models for the rainfall forecast DATA MINING APPROACH Hierarchical models for the rainfall forecast DATA MINING APPROACH Thanh-Nghi Do dtnghi@cit.ctu.edu.vn June - 2014 Introduction Problem large scale GCM small scale models Aim Statistical downscaling local

More information

International Journal of Advance Engineering and Research Development. Review Paper On Weather Forecast Using cloud Computing Technique

International Journal of Advance Engineering and Research Development. Review Paper On Weather Forecast Using cloud Computing Technique Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 4, Issue 12, December -2017 e-issn (O): 2348-4470 p-issn (P): 2348-6406 Review

More information

A Neural Network Model for Surface Air Temperature Estimation over the Eastern Part of Thailand in 2004

A Neural Network Model for Surface Air Temperature Estimation over the Eastern Part of Thailand in 2004 A Neural Networ Model for Surface Air Temperature Estimation over the Eastern Part of Thailand in 2004 Wattana Kanbua 1*, Montri Inthachot 2 1 Marine Meteorological Center, Thai Meteorological Department,

More information

Gaussian Models

Gaussian Models Gaussian Models ddebarr@uw.edu 2016-04-28 Agenda Introduction Gaussian Discriminant Analysis Inference Linear Gaussian Systems The Wishart Distribution Inferring Parameters Introduction Gaussian Density

More information

Plan for operational nowcasting system implementation in Pulkovo airport (St. Petersburg, Russia)

Plan for operational nowcasting system implementation in Pulkovo airport (St. Petersburg, Russia) Plan for operational nowcasting system implementation in Pulkovo airport (St. Petersburg, Russia) Pulkovo airport (St. Petersburg, Russia) is one of the biggest airports in the Russian Federation (150

More information

CHAPTER 2: DATA MINING - A MODERN TOOL FOR ANALYSIS. Due to elements of uncertainty many problems in this world appear to be

CHAPTER 2: DATA MINING - A MODERN TOOL FOR ANALYSIS. Due to elements of uncertainty many problems in this world appear to be 11 CHAPTER 2: DATA MINING - A MODERN TOOL FOR ANALYSIS Due to elements of uncertainty many problems in this world appear to be complex. The uncertainty may be either in parameters defining the problem

More information

Short Term Load Forecasting Using Multi Layer Perceptron

Short Term Load Forecasting Using Multi Layer Perceptron International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Short Term Load Forecasting Using Multi Layer Perceptron S.Hema Chandra 1, B.Tejaswini 2, B.suneetha 3, N.chandi Priya 4, P.Prathima

More information

Will it be a Good Ski Season? Correlation between El Niño and U.S. Weather

Will it be a Good Ski Season? Correlation between El Niño and U.S. Weather Will it be a Good Ski Season? Correlation between El Niño and U.S. Weather Lillian Petersen 1 Abstract This study investigates the correlation between El Niño and the weather across the U.S. El Niño is

More information

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD WHAT IS A NEURAL NETWORK? The simplest definition of a neural network, more properly referred to as an 'artificial' neural network (ANN), is provided

More information

LAB 19. Lab 19. Differences in Regional Climate: Why Do Two Cities Located at the Same Latitude and Near a Body of Water Have Such Different Climates?

LAB 19. Lab 19. Differences in Regional Climate: Why Do Two Cities Located at the Same Latitude and Near a Body of Water Have Such Different Climates? Lab Handout Lab 19. Differences in Regional Climate: Why Do Two Cities Located at the Same Latitude and Near a Body of Water Have Such Different Climates? Introduction Weather describes the current atmospheric

More information

Data and prognosis for renewable energy

Data and prognosis for renewable energy The Hong Kong Polytechnic University Department of Electrical Engineering Project code: FYP_27 Data and prognosis for renewable energy by Choi Man Hin 14072258D Final Report Bachelor of Engineering (Honours)

More information

David John Gagne II, NCAR

David John Gagne II, NCAR The Performance Impacts of Machine Learning Design Choices for Gridded Solar Irradiance Forecasting Features work from Evaluating Statistical Learning Configurations for Gridded Solar Irradiance Forecasting,

More information

Nonlinear Classification

Nonlinear Classification Nonlinear Classification INFO-4604, Applied Machine Learning University of Colorado Boulder October 5-10, 2017 Prof. Michael Paul Linear Classification Most classifiers we ve seen use linear functions

More information

Application and verification of ECMWF products 2012

Application and verification of ECMWF products 2012 Application and verification of ECMWF products 2012 Instituto Português do Mar e da Atmosfera, I.P. (IPMA) 1. Summary of major highlights ECMWF products are used as the main source of data for operational

More information

Weather Prediction Using Historical Data

Weather Prediction Using Historical Data Weather Prediction Using Historical Data COMP 381 Project Report Michael Smith 1. Problem Statement Weather prediction is a useful tool for informing populations of expected weather conditions. Weather

More information

Thunderstorm Forecasting by using Artificial Neural Network

Thunderstorm Forecasting by using Artificial Neural Network Thunderstorm Forecasting by using Artificial Neural Network N.F Nik Ismail, D. Johari, A.F Ali, Faculty of Electrical Engineering Universiti Teknologi MARA 40450 Shah Alam Malaysia nikfasdi@yahoo.com.my

More information

Rainfall Prediction using Back-Propagation Feed Forward Network

Rainfall Prediction using Back-Propagation Feed Forward Network Rainfall Prediction using Back-Propagation Feed Forward Network Ankit Chaturvedi Department of CSE DITMR (Faridabad) MDU Rohtak (hry). ABSTRACT Back propagation is most widely used in neural network projects

More information

The Colorado Agricultural no Meteorological Network (CoAgMet) and Crop ET Reports

The Colorado Agricultural no Meteorological Network (CoAgMet) and Crop ET Reports C R O P S E R I E S Irrigation Quick Facts The Colorado Agricultural no. 4.723 Meteorological Network (CoAgMet) and Crop ET Reports A.A. Andales, T. A. Bauder and N. J. Doesken 1 (10/09) CoAgMet is a network

More information

Use the terms from the following list to complete the sentences below. Each term may be used only once.

Use the terms from the following list to complete the sentences below. Each term may be used only once. Skills Worksheet Directed Reading Section: Air Masses Use the terms from the following list to complete the sentences below. Each term may be used only once. high pressure poles low pressure equator wind

More information

Application and verification of ECMWF products at the Finnish Meteorological Institute

Application and verification of ECMWF products at the Finnish Meteorological Institute Application and verification of ECMWF products 2010 2011 at the Finnish Meteorological Institute by Juhana Hyrkkènen, Ari-Juhani Punkka, Henri Nyman and Janne Kauhanen 1. Summary of major highlights ECMWF

More information

The next generation in weather radar software.

The next generation in weather radar software. The next generation in weather radar software. PUBLISHED BY Vaisala Oyj Phone (int.): +358 9 8949 1 P.O. Box 26 Fax: +358 9 8949 2227 FI-00421 Helsinki Finland Try IRIS Focus at iris.vaisala.com. Vaisala

More information

7 - DE Website Document Weather Meteorology

7 - DE Website Document Weather Meteorology Date: Subject: 7 - DE Website Document Weather Meteorology Name: Section: E.ES.07.71 - Compare and contrast the difference and relationship between climate and weather. E.ES.07.74 - Describe weather conditions

More information

Weather Unit Part 2: Meteorology

Weather Unit Part 2: Meteorology Name: Earth Science Date: Period: Weather Unit Part 2: Meteorology 1 The map below shows North America and its surrounding bodies of water. Country borders are shown. On the map, label the following locations:

More information

Application and verification of ECMWF products: 2010

Application and verification of ECMWF products: 2010 Application and verification of ECMWF products: 2010 Hellenic National Meteorological Service (HNMS) F. Gofa, D. Tzeferi and T. Charantonis 1. Summary of major highlights In order to determine the quality

More information

MESOSCALE MODELLING OVER AREAS CONTAINING HEAT ISLANDS. Marke Hongisto Finnish Meteorological Institute, P.O.Box 503, Helsinki

MESOSCALE MODELLING OVER AREAS CONTAINING HEAT ISLANDS. Marke Hongisto Finnish Meteorological Institute, P.O.Box 503, Helsinki MESOSCALE MODELLING OVER AREAS CONTAINING HEAT ISLANDS Marke Hongisto Finnish Meteorological Institute, P.O.Box 503, 00101 Helsinki INTRODUCTION Urban heat islands have been suspected as being partially

More information

Retrieval of Cloud Top Pressure

Retrieval of Cloud Top Pressure Master Thesis in Statistics and Data Mining Retrieval of Cloud Top Pressure Claudia Adok Division of Statistics and Machine Learning Department of Computer and Information Science Linköping University

More information

Introduction to ML. Two examples of Learners: Naïve Bayesian Classifiers Decision Trees

Introduction to ML. Two examples of Learners: Naïve Bayesian Classifiers Decision Trees Introduction to ML Two examples of Learners: Naïve Bayesian Classifiers Decision Trees Why Bayesian learning? Probabilistic learning: Calculate explicit probabilities for hypothesis, among the most practical

More information

DOWNLOAD PDF READING CLIMATE MAPS

DOWNLOAD PDF READING CLIMATE MAPS Chapter 1 : Template:Climate chart/how to read a climate chart - Wikipedia Maps don't just tell you which way to go they can tell you practically everything about an area of land, even the weather. Learn

More information

Remote Ground based observations Merging Method For Visibility and Cloud Ceiling Assessment During the Night Using Data Mining Algorithms

Remote Ground based observations Merging Method For Visibility and Cloud Ceiling Assessment During the Night Using Data Mining Algorithms Remote Ground based observations Merging Method For Visibility and Cloud Ceiling Assessment During the Night Using Data Mining Algorithms Driss BARI Direction de la Météorologie Nationale Casablanca, Morocco

More information

CHAPTER 1: INTRODUCTION

CHAPTER 1: INTRODUCTION CHAPTER 1: INTRODUCTION There is now unequivocal evidence from direct observations of a warming of the climate system (IPCC, 2007). Despite remaining uncertainties, it is now clear that the upward trend

More information

Classification with Perceptrons. Reading:

Classification with Perceptrons. Reading: Classification with Perceptrons Reading: Chapters 1-3 of Michael Nielsen's online book on neural networks covers the basics of perceptrons and multilayer neural networks We will cover material in Chapters

More information

Naive Bayes classification

Naive Bayes classification Naive Bayes classification Christos Dimitrakakis December 4, 2015 1 Introduction One of the most important methods in machine learning and statistics is that of Bayesian inference. This is the most fundamental

More information

A REVIEW ARTICLE ON NAIVE BAYES CLASSIFIER WITH VARIOUS SMOOTHING TECHNIQUES

A REVIEW ARTICLE ON NAIVE BAYES CLASSIFIER WITH VARIOUS SMOOTHING TECHNIQUES Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 10, October 2014,

More information

Application and verification of ECMWF products 2017

Application and verification of ECMWF products 2017 Application and verification of ECMWF products 2017 Finnish Meteorological Institute compiled by Weather and Safety Centre with help of several experts 1. Summary of major highlights FMI s forecasts are

More information

Chapter 12: Meteorology

Chapter 12: Meteorology Chapter 12: Meteorology Section 1: The Causes of Weather 1. Compare and contrast weather and climate. 2. Analyze how imbalances in the heating of Earth s surface create weather. 3. Describe how and where

More information

Decision T ree Tree Algorithm Week 4 1

Decision T ree Tree Algorithm Week 4 1 Decision Tree Algorithm Week 4 1 Team Homework Assignment #5 Read pp. 105 117 of the text book. Do Examples 3.1, 3.2, 3.3 and Exercise 3.4 (a). Prepare for the results of the homework assignment. Due date

More information

ECML PKDD Discovery Challenges 2017

ECML PKDD Discovery Challenges 2017 ECML PKDD Discovery Challenges 2017 Roberto Corizzo 1 and Dino Ienco 2 1 Department of Computer Science, University of Bari Aldo Moro, Bari, Italy roberto.corizzo@uniba.it 2 Irstea, UMR TETIS, Univ. Montpellier,

More information

Module 11: Meteorology Topic 5 Content: Weather Maps Notes

Module 11: Meteorology Topic 5 Content: Weather Maps Notes Introduction A variety of weather maps are produced by the National Weather Service and National Oceanographic Atmospheric Administration. These maps are used to help meteorologists accurately predict

More information

3.5 CLOUDS OBJECTIVES

3.5 CLOUDS OBJECTIVES 3.5 1 3.5 CLOUDS OBJECTIVES Identify stratus and cumulus clouds. Estimate and record the daily amount of cloud cover. Hypothesize about temperature in a cloud s shadow and in sunlight. Measure temperatures

More information

Prediction of Hourly Solar Radiation in Amman-Jordan by Using Artificial Neural Networks

Prediction of Hourly Solar Radiation in Amman-Jordan by Using Artificial Neural Networks Int. J. of Thermal & Environmental Engineering Volume 14, No. 2 (2017) 103-108 Prediction of Hourly Solar Radiation in Amman-Jordan by Using Artificial Neural Networks M. A. Hamdan a*, E. Abdelhafez b

More information

1 What Is Climate? TAKE A LOOK 2. Explain Why do areas near the equator tend to have high temperatures?

1 What Is Climate? TAKE A LOOK 2. Explain Why do areas near the equator tend to have high temperatures? CHAPTER 17 1 What Is Climate? SECTION Climate BEFORE YOU READ After you read this section, you should be able to answer these questions: What is climate? What factors affect climate? How do climates differ

More information

Prediction of Monthly Rainfall of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM)

Prediction of Monthly Rainfall of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM) Vol- Issue-3 25 Prediction of ly of Nainital Region using Artificial Neural Network (ANN) and Support Vector Machine (SVM) Deepa Bisht*, Mahesh C Joshi*, Ashish Mehta** *Department of Mathematics **Department

More information

AREP GAW. AQ Forecasting

AREP GAW. AQ Forecasting AQ Forecasting What Are We Forecasting Averaging Time (3 of 3) PM10 Daily Maximum Values, 2001 Santiago, Chile (MACAM stations) 300 Level 2 Pre-Emergency Level 1 Alert 200 Air Quality Standard 150 100

More information

Study on Links between Cerebral Infarction and Climate Change Based on Hidden Markov Models

Study on Links between Cerebral Infarction and Climate Change Based on Hidden Markov Models International Journal of Social Science Studies Vol. 3, No. 5; September 2015 ISSN 2324-8033 E-ISSN 2324-8041 Published by Redfame Publishing URL: http://ijsss.redfame.com Study on Links between Cerebral

More information

Description of the case study

Description of the case study Description of the case study During the night and early morning of the 14 th of July 011 the significant cloud layer expanding in the West of the country and slowly moving East produced precipitation

More information

ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables

ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables ANN and Statistical Theory Based Forecasting and Analysis of Power System Variables Sruthi V. Nair 1, Poonam Kothari 2, Kushal Lodha 3 1,2,3 Lecturer, G. H. Raisoni Institute of Engineering & Technology,

More information

A Support Vector Regression Model for Forecasting Rainfall

A Support Vector Regression Model for Forecasting Rainfall A Support Vector Regression for Forecasting Nasimul Hasan 1, Nayan Chandra Nath 1, Risul Islam Rasel 2 Department of Computer Science and Engineering, International Islamic University Chittagong, Bangladesh

More information

SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN THE NORTH ATLANTIC OSCILLATION AND RAINFALL PATTERNS IN BARBADOS

SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN THE NORTH ATLANTIC OSCILLATION AND RAINFALL PATTERNS IN BARBADOS INTERNATIONAL JOURNAL OF CLIMATOLOGY Int. J. Climatol. 6: 89 87 (6) Published online in Wiley InterScience (www.interscience.wiley.com). DOI:./joc. SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN

More information

Name: Regents Earth Science. Weather and Climate - Review Questions

Name: Regents Earth Science. Weather and Climate - Review Questions Name: Regents Earth Science Weather and Climate - Review Questions Date: Thayer - 279 1. People sometimes release substances into the atmosphere to increase the probability of rain by A) raising the air

More information

8.6 Bayesian neural networks (BNN) [Book, Sect. 6.7]

8.6 Bayesian neural networks (BNN) [Book, Sect. 6.7] 8.6 Bayesian neural networks (BNN) [Book, Sect. 6.7] While cross-validation allows one to find the weight penalty parameters which would give the model good generalization capability, the separation of

More information

Integrated Electricity Demand and Price Forecasting

Integrated Electricity Demand and Price Forecasting Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form

More information

National Climatic Data Center Data Management Issues Tom Karl Director, NOAA s National Climatic Data Center

National Climatic Data Center Data Management Issues Tom Karl Director, NOAA s National Climatic Data Center National Climatic Data Center Data Management Issues Tom Karl Director, NOAA s National Climatic Data Center Opening Meeting NOAA Science Advisory Board s Data Archiving and Access Requirements Working

More information

AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS

AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS AN INTERNATIONAL SOLAR IRRADIANCE DATA INGEST SYSTEM FOR FORECASTING SOLAR POWER AND AGRICULTURAL CROP YIELDS James Hall JHTech PO Box 877 Divide, CO 80814 Email: jameshall@jhtech.com Jeffrey Hall JHTech

More information

The next generation in weather radar software.

The next generation in weather radar software. The next generation in weather radar software. PUBLISHED BY Vaisala Oyj Phone (int.): +358 9 8949 1 P.O. Box 26 Fax: +358 9 8949 2227 FI-00421 Helsinki Finland Try IRIS Focus at iris.vaisala.com. Vaisala

More information

Prediction of Citations for Academic Papers

Prediction of Citations for Academic Papers 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Areas of the World with high Insolation

Areas of the World with high Insolation Areas of the World with high Insolation 22 /43 Africa Annual Average Global Horizontal Solar Radiation Map 19 Jul 2006 A map depicting model estimates of monthly average daily total radiation using inputs

More information

Spatial Role Labeling CS365 Course Project

Spatial Role Labeling CS365 Course Project Spatial Role Labeling CS365 Course Project Amit Kumar, akkumar@iitk.ac.in Chandra Sekhar, gchandra@iitk.ac.in Supervisor : Dr.Amitabha Mukerjee ABSTRACT In natural language processing one of the important

More information

The Global Scope of Climate. The Global Scope of Climate. Keys to Climate. Chapter 8

The Global Scope of Climate. The Global Scope of Climate. Keys to Climate. Chapter 8 The Global Scope of Climate Chapter 8 The Global Scope of Climate In its most general sense, climate is the average weather of a region, but except where conditions change very little during the course

More information

Muhammad Noor* & Tarmizi Ismail

Muhammad Noor* & Tarmizi Ismail Malaysian Journal of Civil Engineering 30(1):13-22 (2018) DOWNSCALING OF DAILY AVERAGE RAINFALL OF KOTA BHARU KELANTAN, MALAYSIA Muhammad Noor* & Tarmizi Ismail Department of Hydraulic and Hydrology, Faculty

More information

Analysis of Data Mining Techniques for Weather Prediction

Analysis of Data Mining Techniques for Weather Prediction ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Indian Journal of Science and Technology, Vol 9(38), DOI: 10.17485/ijst/2016/v9i38/101962, October 2016 Analysis of Data Mining Techniques for Weather

More information

Numerical Learning Algorithms

Numerical Learning Algorithms Numerical Learning Algorithms Example SVM for Separable Examples.......................... Example SVM for Nonseparable Examples....................... 4 Example Gaussian Kernel SVM...............................

More information

DM-Group Meeting. Subhodip Biswas 10/16/2014

DM-Group Meeting. Subhodip Biswas 10/16/2014 DM-Group Meeting Subhodip Biswas 10/16/2014 Papers to be discussed 1. Crowdsourcing Land Use Maps via Twitter Vanessa Frias-Martinez and Enrique Frias-Martinez in KDD 2014 2. Tracking Climate Change Opinions

More information

QualiMET 2.0. The new Quality Control System of Deutscher Wetterdienst

QualiMET 2.0. The new Quality Control System of Deutscher Wetterdienst QualiMET 2.0 The new Quality Control System of Deutscher Wetterdienst Reinhard Spengler Deutscher Wetterdienst Department Observing Networks and Data Quality Assurance of Meteorological Data Michendorfer

More information

CSE 417T: Introduction to Machine Learning. Final Review. Henry Chai 12/4/18

CSE 417T: Introduction to Machine Learning. Final Review. Henry Chai 12/4/18 CSE 417T: Introduction to Machine Learning Final Review Henry Chai 12/4/18 Overfitting Overfitting is fitting the training data more than is warranted Fitting noise rather than signal 2 Estimating! "#$

More information

Global Climates. Name Date

Global Climates. Name Date Global Climates Name Date No investigation of the atmosphere is complete without examining the global distribution of the major atmospheric elements and the impact that humans have on weather and climate.

More information

Algorithms for Classification: The Basic Methods

Algorithms for Classification: The Basic Methods Algorithms for Classification: The Basic Methods Outline Simplicity first: 1R Naïve Bayes 2 Classification Task: Given a set of pre-classified examples, build a model or classifier to classify new cases.

More information

CSC Neural Networks. Perceptron Learning Rule

CSC Neural Networks. Perceptron Learning Rule CSC 302 1.5 Neural Networks Perceptron Learning Rule 1 Objectives Determining the weight matrix and bias for perceptron networks with many inputs. Explaining what a learning rule is. Developing the perceptron

More information

Put the Weather to Work for Your Company

Put the Weather to Work for Your Company SAP Data Network Put the Weather to Work for Your Company Extend the Value of Your Business and Transactional Solutions by Incorporating Weather Data 1 / 7 Table of Contents 3 Enrich Business Data with

More information

Trends Forecasting. Overview: Objectives: GLEs Addressed: Materials: Activity Procedure:

Trends Forecasting. Overview: Objectives: GLEs Addressed: Materials: Activity Procedure: Trends Forecasting Overview: How do meteorologists forecast weather? Students learn one of 5 main methods for forecasting weather, use it to create a forecast, and analyze results. Objectives: The student

More information

Homogenization of the Hellenic cloud amount time series

Homogenization of the Hellenic cloud amount time series Homogenization of the Hellenic cloud amount time series A Argiriou 1, A Mamara 2, E Dimadis 1 1 Laboratory of Atmospheric Physics, 2 Hellenic Meteorological Service October 19, 2017 A Argiriou 1, A Mamara

More information

25.1 Air Masses. Section 25.1 Objectives

25.1 Air Masses. Section 25.1 Objectives Section 25.1 Objectives Explain how an air mass forms. List the four main types of air masses. Describe how air masses affect the weather of North America. Air Masses 25.1 Air Masses Differences in air

More information

What is Climate? Understanding and predicting climatic changes are the basic goals of climatology.

What is Climate? Understanding and predicting climatic changes are the basic goals of climatology. What is Climate? Understanding and predicting climatic changes are the basic goals of climatology. Climatology is the study of Earth s climate and the factors that affect past, present, and future climatic

More information

The Meteorological Observatory from Neumayer Gert König-Langlo, Bernd Loose Alfred-Wegener-Institut, Bremerhaven, Germany

The Meteorological Observatory from Neumayer Gert König-Langlo, Bernd Loose Alfred-Wegener-Institut, Bremerhaven, Germany The Meteorological Observatory from Neumayer Gert König-Langlo, Bernd Loose Alfred-Wegener-Institut, Bremerhaven, Germany History of Neumayer In March 1981, the Georg von Neumayer Station (70 37 S, 8 22

More information

Some details about the theoretical background of CarpatClim DanubeClim gridded databases and their practical consequences

Some details about the theoretical background of CarpatClim DanubeClim gridded databases and their practical consequences Some details about the theoretical background of CarpatClim DanubeClim gridded databases and their practical consequences Zita Bihari, Tamás Szentimrey, Andrea Kircsi Hungarian Meteorological Service Outline

More information

A Novel Method for Predicting the Power Output of Distributed Renewable Energy Resources

A Novel Method for Predicting the Power Output of Distributed Renewable Energy Resources A Novel Method for Predicting the Power Output of Distributed Renewable Energy Resources Athanasios Aris Panagopoulos1 Supervisor: Georgios Chalkiadakis1 Technical University of Crete, Greece A thesis

More information

Appendix 1: UK climate projections

Appendix 1: UK climate projections Appendix 1: UK climate projections The UK Climate Projections 2009 provide the most up-to-date estimates of how the climate may change over the next 100 years. They are an invaluable source of information

More information

SPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM

SPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM SPATIAL DATA MINING Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM INTRODUCTION The main difference between data mining in relational DBS and in spatial DBS is that attributes of the neighbors

More information

UNIT 1. WEATHER AND CLIMATE. PRIMARY 4/ Social Science Pedro Antonio López Hernández

UNIT 1. WEATHER AND CLIMATE. PRIMARY 4/ Social Science Pedro Antonio López Hernández UNIT 1. WEATHER AND CLIMATE PRIMARY 4/ Social Science Pedro Antonio López Hernández LAYERS OF THE ATMOSPHERE The atmosphere is a mixture of gases that surround Earth and separate it from the rest of the

More information

Use of lightning data to improve observations for aeronautical activities

Use of lightning data to improve observations for aeronautical activities Use of lightning data to improve observations for aeronautical activities Françoise Honoré Jean-Marc Yvagnes Patrick Thomas Météo_France Toulouse France I Introduction Aeronautical activities are very

More information

LAB J - WORLD CLIMATE ZONES

LAB J - WORLD CLIMATE ZONES Introduction LAB J - WORLD CLIMATE ZONES The objective of this lab is to familiarize the student with the various climates around the world and the climate controls that influence these climates. Students

More information

VALIDATION RESULTS OF THE OPERATIONAL LSA-SAF SNOW COVER MAPPING

VALIDATION RESULTS OF THE OPERATIONAL LSA-SAF SNOW COVER MAPPING VALIDATION RESULTS OF THE OPERATIONAL LSA-SAF SNOW COVER MAPPING Niilo Siljamo, Otto Hyvärinen Finnish Meteorological Institute, Erik Palménin aukio 1, P.O.Box 503, FI-00101 HELSINKI Abstract Hydrological

More information

Annual WWW Technical Progress Report. On the Global Data Processing and Forecasting System 2004 OMAN

Annual WWW Technical Progress Report. On the Global Data Processing and Forecasting System 2004 OMAN Annual WWW Technical Progress Report On the Global Data Processing and Forecasting System 2004 OMAN Summary of highlights Oman Meteorological Service, located in Muscat, Oman develops, produces and disseminates

More information

Mining Classification Knowledge

Mining Classification Knowledge Mining Classification Knowledge Remarks on NonSymbolic Methods JERZY STEFANOWSKI Institute of Computing Sciences, Poznań University of Technology SE lecture revision 2013 Outline 1. Bayesian classification

More information

Click Prediction and Preference Ranking of RSS Feeds

Click Prediction and Preference Ranking of RSS Feeds Click Prediction and Preference Ranking of RSS Feeds 1 Introduction December 11, 2009 Steven Wu RSS (Really Simple Syndication) is a family of data formats used to publish frequently updated works. RSS

More information

Leveraging Sentinel-1 time-series data for mapping agricultural land cover and land use in the tropics

Leveraging Sentinel-1 time-series data for mapping agricultural land cover and land use in the tropics Leveraging Sentinel-1 time-series data for mapping agricultural land cover and land use in the tropics Caitlin Kontgis caitlin@descarteslabs.com @caitlinkontgis Descartes Labs Overview What is Descartes

More information

SuperPack North America

SuperPack North America SuperPack North America Speedwell SuperPack makes available an unprecedented range of quality historical weather data, and weather data feeds for a single annual fee. SuperPack dramatically simplifies

More information

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Kathrin Bujna 1 and Martin Wistuba 2 1 Paderborn University 2 IBM Research Ireland Abstract.

More information

PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES

PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES PRICING AND PROBABILITY DISTRIBUTIONS OF ATMOSPHERIC VARIABLES TECHNICAL WHITE PAPER WILLIAM M. BRIGGS Abstract. Current methods of assessing the probability distributions of atmospheric variables are

More information

Application and verification of ECMWF products 2011

Application and verification of ECMWF products 2011 Application and verification of ECMWF products 2011 National Meteorological Administration 1. Summary of major highlights Medium range weather forecasts are primarily based on the results of ECMWF and

More information

The Perceptron. Volker Tresp Summer 2016

The Perceptron. Volker Tresp Summer 2016 The Perceptron Volker Tresp Summer 2016 1 Elements in Learning Tasks Collection, cleaning and preprocessing of training data Definition of a class of learning models. Often defined by the free model parameters

More information

COMPARING PERFORMANCE OF NEURAL NETWORKS RECOGNIZING MACHINE GENERATED CHARACTERS

COMPARING PERFORMANCE OF NEURAL NETWORKS RECOGNIZING MACHINE GENERATED CHARACTERS Proceedings of the First Southern Symposium on Computing The University of Southern Mississippi, December 4-5, 1998 COMPARING PERFORMANCE OF NEURAL NETWORKS RECOGNIZING MACHINE GENERATED CHARACTERS SEAN

More information

Decision Support. Dr. Johan Hagelbäck.

Decision Support. Dr. Johan Hagelbäck. Decision Support Dr. Johan Hagelbäck johan.hagelback@lnu.se http://aiguy.org Decision Support One of the earliest AI problems was decision support The first solution to this problem was expert systems

More information

Data Mining Part 4. Prediction

Data Mining Part 4. Prediction Data Mining Part 4. Prediction 4.3. Fall 2009 Instructor: Dr. Masoud Yaghini Outline Introduction Bayes Theorem Naïve References Introduction Bayesian classifiers A statistical classifiers Introduction

More information