STATISTICAL ANALYSIS OF WIND AND RAINFALL WITH FUNCTIONAL DATA ANALYSIS TECHNIQUE WAN NORLIYANA BT WAN ISMAIL

Similar documents
A STUDY ON THE CHARACTERISTICS OF RAINFALL DATA AND ITS PARAMETER ESTIMATES JAYANTI A/P ARUMUGAM UNIVERSITI TEKNOLOGI MALAYSIA

Functional Data Analysis Technique on Daily Rainfall Data: A Case Study at North Region of Peninsular Malaysia.

DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA

MONTE CARLO SIMULATION OF NEUTRON RADIOGRAPHY 2 (NUR-2) SYSTEM AT TRIGA MARK II RESEARCH REACTOR OF MALAYSIAN NUCLEAR AGENCY

RAINFALL SERIES LIM TOU HIN

OPTIMIZATION OF CHROMIUM, NICKEL AND VANADIUM ANALYSIS IN CRUDE OIL USING GRAPHITE FURNACE ATOMIC ABSORPTION SPECTROSCOPY NURUL HANIS KAMARUDIN

MATHEMATICAL MODELLING OF UNSTEADY BIOMAGNETIC FLUID FLOW AND HEAT TRANSFER WITH GRAVITATIONAL ACCELERATION

INFLUENCES OF GROUNDWATER, RAINFALL, AND TIDES ON BEACH PROFILES CHANGES AT DESARU BEACH FARIZUL NIZAM BIN ABDULLAH

ARTIFICIAL NEURAL NETWORK AND KALMAN FILTER APPROACHES BASED ON ARIMA FOR DAILY WIND SPEED FORECASTING OSAMAH BASHEER SHUKUR

INDIRECT TENSION TEST OF HOT MIX ASPHALT AS RELATED TO TEMPERATURE CHANGES AND BINDER TYPES AKRIMA BINTI ABU BAKAR

COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR

EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA

ANOLYTE SOLUTION GENERATED FROM ELECTROCHEMICAL ACTIVATION PROCESS FOR THE TREATMENT OF PHENOL

ROOFTOP MAPPING AND INFORMATION SYSTEM FOR RAINWATER HARVESTING SURAYA BINTI SAMSUDIN UNIVERSITI TEKNOLOGI MALAYSIA

DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

OPTIMAL CONTROL BASED ON NONLINEAR CONJUGATE GRADIENT METHOD IN CARDIAC ELECTROPHYSIOLOGY NG KIN WEI UNIVERSITI TEKNOLOGI MALAYSIA

EFFECT OF GRAPHENE OXIDE (GO) IN IMPROVING THE PERFORMANCE OF HIGH VOLTAGE INSULATOR NUR FARAH AIN BINTI ISA UNIVERSITI TEKNOLOGI MALAYSIA

HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN

CHARACTERISTICS OF SOLITARY WAVE IN FIBER BRAGG GRATING MARDIANA SHAHADATUL AINI BINTI ZAINUDIN UNIVERSITI TEKNOLOGI MALAYSIA

BOUNDARY INTEGRAL EQUATION WITH THE GENERALIZED NEUMANN KERNEL FOR COMPUTING GREEN S FUNCTION FOR MULTIPLY CONNECTED REGIONS

INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA

EMPIRICAL STRENQTH ENVELOPE FOR SHALE NUR 'AIN BINTI MAT YUSOF

ROCK SHAFT RESISTANCE OF BORED PILES SOCKETED INTO DECOMPOSED MALAYSIA GRANITE RAJA SHAHROM NIZAM SHAH BIN RAJA SHOIB UNIVERSITI TEKNOLOGI MALAYSIA

SEDIMENTATION RATE AT SETIU LAGOON USING NATURAL. RADIOTRACER 210 Pb TECHNIQUE

EFFECTS OF Ni 3 Ti (DO 24 ) PRECIPITATES AND COMPOSITION ON Ni-BASED SUPERALLOYS USING MOLECULAR DYNAMICS METHOD

NUMERICAL SIMULATIONS ON THE EFFECTS OF USING VENTILATION FAN ON CONDITIONS OF AIR INSIDE A CAR PASSENGER COMPARTMENT INTAN SABARIAH BINTI SABRI

NUMERICAL INVESTIGATION OF TURBULENT NANOFLUID FLOW EFFECT ON ENHANCING HEAT TRANSFER IN STRAIGHT CHANNELS DHAFIR GIYATH JEHAD

A COMPUTATIONAL FLUID DYNAMIC FRAMEWORK FOR MODELING AND SIMULATION OF PROTON EXCHANGE MEMBRANE FUEL CELL HAMID KAZEMI ESFEH

FLOOD MAPPING OF NORTHERN PENINSULAR MALAYSIA USING SAR IMAGES HAFSAT SALEH DUTSENWAI

SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM

SYNTHESIS AND CHARACTERIZATION OF POLYACRYLAMIDE BASED HYDROGEL CONTAINING MAGNESIUM OXIDE NANOPARTICLES FOR ANTIBACTERIAL APPLICATIONS

PRODUCTION OF POLYHYDROXYALKANATE (PHA) FROM WASTE COOKING OIL USING PSEUDOMONAS OLEOVORANS FARZANEH SABBAGH MOJAVERYAZDI

THE DEVELOPMENT OF PORTABLE SENSOR TO DETERMINE THE FRESHNESS AND QUALITY OF FRUITS USING CAPACITIVE TECHNIQUE LAI KAI LING

SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE

ISOLATION AND CHARACTERIZATION OF NANOCELLULOSE FROM EMPTY FRUIT BUNCH FIBER FOR NANOCOMPOSITE APPLICATION

WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE AIRFOIL MAHDI HOZHABRI NAMIN UNIVERSITI TEKNOLOGI MALAYSIA

ADSORPTION OF ARSENATE BY HEXADECYLPYRIDINIUM BROMIDE MODIFIED NATURAL ZEOLITE MOHD AMMARUL AFFIQ BIN MD BUANG UNIVERSITI TEKNOLOGI MALAYSIA

SOLAR RADIATION EQUATION OF TIME PUNITHA A/P MARIMUTHOO

UNIVERSITI PUTRA MALAYSIA

MULTISTAGE ARTIFICIAL NEURAL NETWORK IN STRUCTURAL DAMAGE DETECTION GOH LYN DEE

ERODABLE DAM BREACHING PATTERNS DUE TO OVERTOPPING NOR AIN BINTI MAT LAZIN UNIVERSITI TEKNOLOGI MALAYSIA

STABILITY AND SIMULATION OF A STANDING WAVE IN POROUS MEDIA LAU SIEW CHING UNIVERSTI TEKNOLOGI MALAYSIA

OPTICAL TWEEZER INDUCED BY MICRORING RESONATOR MUHAMMAD SAFWAN BIN ABD AZIZ

MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA

ULTIMATE STRENGTH ANALYSIS OF SHIPS PLATE DUE TO CORROSION ZULFAQIH BIN LAZIM

AHMED MOKHTAR ALBSHIR BUDIEA

MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA

COMMUTATIVITY DEGREES AND RELATED INVARIANTS OF SOME FINITE NILPOTENT GROUPS FADILA NORMAHIA BINTI ABD MANAF UNIVERSITI TEKNOLOGI MALAYSIA

MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI. UNIVERSITI TEKNOLOGI MALAYSIAi

POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN

Nur Atikah binti Md.Nor

ELIMINATION OF RAINDROPS EFFECTS IN INFRARED SENSITIVE CAMERA AHMAD SHARMI BIN ABDULLAH

ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI

Regional Climate Observation & Simulation of Extreme Temperature & Precipitation Trends

DENSITY FUNCTIONAL THEORY SIMULATION OF MAGNETISM DUE TO ATOMIC VACANCIES IN GRAPHENE USING SIESTA

DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M.

EFFECT OF ROCK MASS PROPERTIES ON SKIN FRICTION OF ROCK SOCKET. YUSLIZA BINTI ALIAS UNIVERSITI TEKNOLOGI MALAYSIA

COMPUTATIONAL STUDY OF PROTON TRANSFER IN RESTRICTED SULFONIC ACID FOR PROTON EXCHANGE MEMBRANE FUEL CELL SITI NADIAH BINTI MD AJEMAN

MECHANICAL PROPERTIES OF CALCIUM CARBONATE AND COCONUT SHELL FILLED POLYPROPYLENE COMPOSITES MEHDI HEIDARI UNIVERSITI TEKNOLOGI MALAYSIA

GENERATING MULTI-LEVEL OF DETAILS FOR THREE-DIMENSIONAL BUILDING MODELUSINGTERRESTRIAL LASER SCANNINGDATA RIZKA AKMALIA

REPORT ON HEAVY RAINFALL THAT CAUSED FLOODS IN JOHOR, MELAKA, NEGERI SEMBILAN AND PAHANG. DURING THE PERIOD 17 th 20 th DECEMBER 2006

ADSORPTION AND STRIPPING OF ETHANOL FROM AQUEOUS SOLUTION USING SEPABEADS207 ADSORBENT

Fitting Daily Rainfall Amount in Peninsular Malaysia Using Several Types of Exponential Distributions

NUMERICAL EXPERIMENT OF RADIATION SELF-ABSORPTION AND RADIATION DYNAMICS IN THE DENSE PLASMA FOCUS USING LEE MODEL ZAHRA ALI

DEVELOPMENT OF PROCESS-BASED ENTROPY MEASUREMENT FRAMEWORK FOR ORGANIZATIONS MAHMOOD OLYAIY

SPECTROSCOPIC PROPERTIES OF Nd:YAG LASER PUMPED BY FLASHLAMP AT VARIOUS TEMPERATURES AND INPUT ENERGIES SEYED EBRAHIM POURMAND

GROUPS OF ORDER AT MOST 24

SYNTHESIS AND CHARACTERIZATION OF MESO-SUBSTITUTED PORPHYRIN MUHAMMAD TAHIR MUHAMMAD

ENHANCED COPY-PASTE IMAGE FORGERY DETECTION BASED ON ARCHIMEDEAN SPIRAL AND COARSE-TO-FINE APPROACH MOHAMMAD AKBARPOUR SEKEH

BROYDEN S AND THOMAS METHODS FOR IDENTIFYING SINGULAR ROOTS IN NONLINER SYSTEMS IKKA AFIQAH BINTI AMIR

UNIVERSITI TEKNOLOGI MALAYSIA

UNSTEADY MAGNETOHYDRODYNAMICS FLOW OF A MICROPOLAR FLUID WITH HEAT AND MASS TRANSFER AURANGZAIB MANGI UNIVERSITI TEKNOLOGI MALAYSIA

CLASSIFICATION OF MULTIBEAM SNIPPETS DATA USING STATISTICAL ANALYSIS METHOD LAU KUM WENG UNIVERSITITEKNOLOGI MALAYSIA

AMPLIFICATION OF PARTIAL RICE FLORIGEN FROM MALAYSIAN UPLAND RICE CULTIVAR HITAM AND WAI ABDULRAHMAN MAHMOUD DOGARA UNIVERSITI TEKNOLOGI MALAYSIA

INTEGRATED MALAYSIAN METEOROLOGICAL DATA ATMOSPHERIC DISPERSION SOFTWARE FOR AIR POLLUTANT DISPERSION SIMULATION

A GENERALIZED POWER-LAW MODEL OF BLOOD FLOW THROUGH TAPERED ARTERIES WITH AN OVERLAPPING STENOSIS HUDA SALMI BINTI AHMAD

HIGH RESOLUTION DIGITAL ELEVATION MODEL GENERATION BY SEMI GLOBAL MATCHING APPROACH SITI MUNIRAH BINTI SHAFFIE

FOURIER TRANSFORM TECHNIQUE FOR ANALYTICAL SOLUTION OF DIFFUSION EQUATION OF CONCENTRATION SPHERICAL DROPS IN ROTATING DISC CONTACTOR COLUMN

SYNTHESIS AND CHARACTERIZATION OF SUPRAMOLECULAR POLYMER BASED ON LINOLEIC ACID OF SUNFLOWER OIL MILI PURBAYA UNIVERSITI TEKNOLOGI MALAYSIA

COMPUTATIONAL ANALYSIS OF HEAT TRANSFER RATE USING THERMAL RESPONSE FACTOR METHOD TENGKU FAKHRUZZAMAN BIN TENGKU YUSOFF

qobka=^k^ivpfp=clo=aolrdeq=bsbkq=fk=mbkfkpri^o= j^i^vpf^=

A COMPARISO OF MODAL FLEXIBILITY METHOD A D MODAL CURVATURE METHOD I STRUCTURAL DAMAGE DETECTIO OOR SABRI A ZAHARUDI

Introduction to Functional Data Analysis A CSCU Workshop. Giles Hooker Biological Statistics and Computational Biology

ROOT FINDING OF A SYSTEM OF NONLINEAR EQUATIONS USING THE COMBINATION OF NEWTON, CONJUGATE GRADIENT AND QUADRATURE METHODS

EFFECTS OF SURFACE ROUGHNESS USING DIFFERENT ELECTRODES ON ELECTRICAL DISCHARGE MACHINING (EDM) MOHD ABD LATIF BIN ABD GHANI

CORRELATION STUDY OF THE STRAIN AND VIBRATION SIGNALS OF A BEAM HAMIZATUN BINTI MOHD FAZI

MICROWAVE SYNTHESIS OF SODALITE FROM COAL FLY ASH AS SOLID BASE CATALYST FOR KNOEVENAGEL REACTION MOHD HILMI BIN MOHAMED

UTILIZING CLASSICAL SCALING FOR FAULT IDENTIFICATION BASED ON CONTINUOUS-BASED PROCESS MIRA SYAHIRAH BT ABD WAHIT

FINITE ELEMENT METHOD FOR TWO-DIMENSIONAL ELASTICITY PROBLEM HANIMAH OTHMAN

SPRAY DRIED PRODIGIOSIN FROM Serratia marcescens AS A FOOD COLORANT SHAHLA NAMAZKAR UNIVERSITI TEKNOLOGI MALAYSIA

SYNTHESIS AND CATALYTIC PERFORMANCE OF PALLADIUM(II) HYDRAZONE COMPLEXES IN SUZUKI-MIYAURA CROSS-COUPLING REACTION

EFFECT OF SOIL TYPE ON SEISMIC PERFROMANCE OF REINFORCED CONCRETE SCHOOL BUILDING

THE EFFECT OF PLATE GEOMETRY ON FLOW AND HEAT TRANSFER ACROSS A PARALLEL PLATE HEAT EXCHANGER

CHARACTERIZATION OF IRRADIATION MODIFICATION OF ETHYLENE VINYL ACETATE/ EPOXIDIZED NATURAL RUBBER/ HALLOYSITE NANOTUBES NANOCOMPOSITES

SOLUBILITY MODEL OF PALM OIL EXTRACTION FROM PALM FRUIT USING SUB-CRITICAL R134a NUR SYUHADA BINTI ABD RAHMAN

THE ABSOLUTE METHOD OF NEUTRON ACTIVATION ANALYSIS USING TRIGA NEUTRON REACTOR, NUCLEAR AGENCY, MALAYSIA LIEW HWI FEN UNIVERSITI TEKNOLOGI MALAYSIA

DEMULSIFICATION OF CRUDE OIL EMULSIONS VIA MICROWAVE ASSISTED CHEMICALS (ENVIRONMENTAL FRIENDLY) NUR HAFIZAH BINTI HUSSIN

UNIVERSITI TEKNOLOGI MARA MODELLING OF SURFACE AIR TEMPERATURE ELEMENTS: INTEGRATION OF MULTIPLE REGRESSION MODEL AND SPATIAL INTERPOLATION TECHNIQUE

ENGINEERING PROPERTIES OF OLDER ALLUVIUM BADEE ABDULQAWI HAMOOD ALSHAMERI. Universiti Teknologi Malaysia

Transcription:

STATISTICAL ANALYSIS OF WIND AND RAINFALL WITH FUNCTIONAL DATA ANALYSIS TECHNIQUE WAN NORLIYANA BT WAN ISMAIL A thesis submitted in fulfillment of the requirement for the award of the degree of Master of Science (Mathematics) Faculty of Science Universiti Teknologi Malaysia JANUARY 2014

To my beloved mother and father iii

iv ACKNOWLEDGEMENT By the name of Allah, this thesis is finally completed. It is an honor to express my fullest gratitude to the people who have been involved in finishing this thesis. In preparing this research, I have been discussing with my supervisor on the solution to overcome the problem in this thesis. I wish to express my fullest gratitude to my thesis supervisor, Dr. Shariffah Suhaila Syed Jamaludin for the guidance and encouragement that make this thesis finally completed. Without her support, this thesis might be not as appropriate as shown here. Not to be forgotten, to my beloved parents, Wan Ismail Wan Ali and Seri Sulong as well as my family, I felt so grateful having all of you in my life, thank you for your profound understanding, helping and endless support me for the whole time I spend in this university. Last but not least, highly thanks also to my friends for supporting me from the beginning of preparing this thesis until it have been completed. There are many motivations and tips that I get from those who support me and I really appreciate it.

v ABSTRACT The speed of wind and rainfall throughout Peninsular Malaysia varies from one region to another, depending on the direction of winds and rainfall that occur because of strong wind and the monsoons. Our data consist of daily mean wind and rainfall data from ten stations and covering 25 years period from 1985 to 2009. The purpose of this study is to convert the wind and rainfall data into a smooth curve by using Functional Data Analysis (FDA) method. The Fourier basis is used in this study in which the wind and rainfall data indicate periodic pattern for ten stations. In this method, to avoid such overfitting data, roughness penalty is added to the least square when constructing functional data object from the observed data. By using small basis functions, the difference is very small between with and without roughness penalty, showing that it is safer to smooth only when required. Meanwhile, with large basis functions and the difference of sum of square is very large, roughness penalty should be added in order to obtain optimal fit data. The graphs with contour plot show the relationship between wind and rainfall data, which illustrate the correlation and cross-correlations functions. Functional linear model also presents the relationship that may exist between wind (functional data) and rainfall (scalar response). Square multiple correlations give strong positive linear correlation, which concludes that the rainfall is influenced by the wind speed.

vi ABSTRAK Kelajuan angin dan hujan di seluruh Semenanjung Malaysia berbeza mengikut kedudukan geografi setiap negeri, iaitu bergantung kepada arah angin dan hujan yang berlaku pada musim tengkujuh. Data ini terdiri daripada purata kelajuan angin dan hujan mengikut harian selama 25 tahun dari tahun 1985 hingga tahun 2009 dan juga diperolehi daripada sepuluh stesen. Tujuan kajian ini dijalankan adalah untuk menukarkan data angin dan hujan kepada lengkungan licin dengan menggunakan kaedah Fungsi Analisis Data (FDA). Fourier asas digunakan dalam kajian ini, di mana data angin dan hujan menunjukkan corak berkala selama sepuluh stesen. Dalam kaedah ini, untuk mengelakkan data overfitting, penalti kekasaran ditambah kepada kuasa dua terkecil apabila membina fungsi objek daripada data yang diperhatikan. Dengan menggunakan fungsi asas yang kecil, perbezaan adalah kecil di antara penggunaan dan tanpa penggunaan penalti kekasaran, menunjukkan bahawa ia adalah lebih selamat untuk melicinkan lengkungan hanya apabila diperlukan. Sementara itu, dengan menggunakan fungsi asas yang besar dan perbezaan jumlah kuasa dua adalah sangat besar, penalti kekasaran perlu ditambah untuk mendapatkan data yang optimum. Kontur plot menunjukkan hubungan antara data angin dan hujan, menggambarkan fungsi korelasi dan korelasi silang. Fungsi model linear juga menunjukkan hubungan yang wujud antara angin (fungsi data) dan hujan (tindak balas skalar). Korelasi square multiple memberi korelasi linear positif yang kuat, menyimpulkan bahawa hujan adalah dipengaruhi oleh kelajuan angin.

vii TABLE OF CONTENTS CHAPTER TITLE PAGE DECLARATION DEDICATION ACKNOWLEDGEMENT ABSTRACT ABSTRAK TABLE OF CONTENTS LIST OF TABLES LIST OF FIGURES LIST OF APPENDIX ii iii iv v vi vii x xi xii 1 INTRODUCTION 1.1 Introduction 1 1.2 Research Background 2 1.3 Problem Statement 3 1.4 Research Objective 4 1.5 Scope of the Study 4 1.6 Significance of the Study 5

viii 2 LITERATURE REVIEW 2.1 Introduction 6 2.2 Overview of Functional Data Analysis 7 2.3 Applications of Functional Data Analysis Techniques 8 2.3.1 FDA in Demographics 9 2.3.2 FDA in Economy / Econometrics 10 2.3.3 FDA in Environments 11 2.3.4 FDA in Medicines / Medicals 14 3 RESEARCH METHODOLOGY 3.1 Introduction 16 3.2 Basis Function 17 3.2.1 Fourier Series Basis System 18 3.2.2 Smoothing Functional Data 19 3.2.3 Roughness Penalty Approach 20 3.3 Descriptions of Functional Data 21 3.3.1 Functional Means and Variances 21 3.3.2 Correlation and Cross-Correlation Functions 22 3.4 Functional Linear Models for Scalar Responses 23 3.4.1 Functional Linear Regression and Correlation 24 3.5 Operational Framework 26 4 RESULTS AND DISCUSSION 4.1 Introduction to the Dataset 27 4.2 Graphical Display of Mean Daily Wind Speed 28 4.3 Graphical Display of Mean Daily Rainfall 31

ix 4.4 Summary Statistics of Mean Daily Wind-Rainfall 34 4.5 Identifying The Number of Basis Functions 35 4.5.1 Smoothing Curves of Mean Daily Wind Speed in Peninsular Malaysia 38 4.5.2 Smoothing Curves of Mean Daily Rainfall Data in Peninsular Malaysia 40 4.6 Smoothing With and Without Roughness Penalty 43 4.6.1 Smoothing With and Without Roughness Penalty by using Small Basis Functions 43 4.6.2 Smoothing With and Without Roughness Penalty by using Large Basis Functions 47 4.7 Descriptions of Functional Data 51 4.7.1 Functional Means and Variances 52 4.7.2 Covariance and Correlation Functions 53 4.7.3 Cross-Covariance and Cross-Correlation Functions 54 4.8 Functional Linear Models for Scalar Responses 56 5 CONCLUSION AND RECOMMENDATION 5.1 Conclusion 60 5.2 Recommendation 62 6 REFERENCES 63

x LIST OF TABLES TABLE NUMBER TITLE PAGE 4.1 Descriptive Statistics of Daily Mean Wind-Rainfall for 10 Stations 34 4.2 Analysis Deviance of Daily Mean Wind Speed for Mersing 36 4.3 Analysis Deviance of Daily Mean Rainfall Data for Kuala Krai 36 4.4 Number of Basis Functions for Wind and Rainfall Data 37 4.5 Values of Lambda, Degrees of Freedom and GCV for Temerloh and Cameron Highlands (Small Basis Functions) 45 4.6 Sum of Square Residuals for Wind and Rainfall Data by Using Small Basis Functions 47 4.7 Values of Lambda, Degrees of Freedom and GCV for Temerloh and Cameron Highlands (Large Basis Functions) 48 4.8 The Values of Lambda, for Wind and Rainfall Data by Using Large Basis Functions 50 4.9 Sum of Square Residuals for Wind and Rainfall Data by Using Large Basis Functions 51 4.10 Estimated Values for the Coefficient Functions, 56 4.11 The Values of Residuals for Rainfall Responding Variables 58

xi LIST OF FIGURES FIGURE NUMBER TITLE PAGE 4.1 The Location of Ten Stations in Peninsular Malaysia 28 4.2 (a) to (j) Mean Daily Wind Speed for Ten Stations 29 4.3 (a) to (j) Mean Daily Rainfall Data for Ten Stations 31 4.4 (a) to (j) Smoothing Curve of Wind Speed for Ten Stations 38 4.5 (a) to (j) Smoothing Curve of Rainfall Data for Ten Stations 40 4.6 (a) & (b) Smoothing Curve without Roughness Penalty for Temerloh (Wind) and C. Highlands (Rainfall) 44 4.7 (a) & (b) The Degrees of Freedom and The Values of the GCV Criterion for Temerloh and C. Highlands (Small Basis Function) 45 4.8 (a) & (b) Smoothing Curve with and without Roughness Penalty for Temerloh (Wind) and C. Highlands (Rainfall) 46 4.9 (a) & (b) The Degrees of Freedom and The Values of the GCV Criterion for Temerloh and C. Highlands (Large Basis Function) 48 4.10 (a) & (b) Smoothing Curve with and without Roughness Penalty for Temerloh (Wind) and C. Highlands (Rainfall) 49 4.11 (a) & (b) The Mean and Standard Deviations for Ten Station of Wind and Rainfall Data 52

xii 4.12 (a) & (b) The Contour Plot of The Bivariate Correlation Function for Wind and Rainfall Data 53 4.13 The Contour Plot of The Cross-Correlation Function for Wind and Rainfall Data 55 4.14 Estimated from Mean Daily Wind using seven Basis Functions 57 4.15 Predicted and Observed Value of Log Annual Rainfall 58 LIST OF APPENDIX APPENDIX TITLE (1) Determine The Analysis Deviances of Wind Data for Ten Station (2) Determine The Analysis Deviances of Rainfall Data for Ten Stations (3) Determine The Plotfit of Smoothing Curves of Wind Data for Ten Stations (4) Determine The Plotfit of Smoothing Curves of Rainfall Data for Ten Stations (5) Define The Smoothing With Roughness Penalty by using Small Basis Functions (5.1) Temerloh Station for Wind Data (5.2) Cameron Highlands Station for Rainfall Data (6) Define The Smoothing With Roughness Penalty by using Large Basis Functions

xiii (6.1) Temerloh Station for Wind Data (6.2) Cameron Highlands Station for Rainfall Data (7) Determine the Descriptions of Functional Data (7.1) Functional Mean and Variance with Correlation Functions for Wind Data (7.2) Functional Mean and Variance with Correlation Functions for Rainfall Data (7.3) Cross-Covariance and Cross-Correlation Functions (8) Functional Linear Models with Scalar Responses (8.1) Functional Linear Regression and Correlation

CHAPTER 1 INTRODUCTION 1.1 Introduction Functional data analysis is a branch of statistics. Functional Data Analysis (FDA) develops fast in statistics area with the aim of estimating a set of related functions or curves rather than focusing on a single entity, like estimating a point, as in classical statistics. Some methods are simply the extension of existing techniques in conventional statistics while others need more than exchanging the summation, which is used in discrete observation, to an integration, which is a continuum. Data sets in FDA are in the form of curves, surfaces or anything else varying over a continuum compare to multivariate statistics, where data are considered as vectors (finite sets of values).

2 Recently, the application of statistical modeling to medicine, biomedicine, public health, biological sciences, biomechanics, environmental science geology, psychology, and economics has largely and increasingly being driven by the need for better data to assist in government policy making and planning processes for any services required. As a matter of fact, discrete data points collected over a continuum can be looked upon as a function or curve that is presumed to be a reasonably smooth to those discrete smoothed points. This continuum is not necessarily a physical time point, but rather attributes such as age, spatial location, seasons, or temperature. Importantly, such models will only be useful in the long term if they are accurate, based on good quality data, and generated through the application of robust appropriate statistical methods. 1.2 Research Background In conventional statistical practice, observation is usually a number or a vector. But in many real-life situations, observed values are continuous curves, vectors of curves, images, or vectors of images. The characteristics about functional data are the possibility of using information on the rates of change or derivatives of the curves. We use the slopes, curvatures, and other characteristics made available because these curves are intrinsically smooth, and can use this information in many useful ways. Not only that, FDA gives a strong link with the multivariate statistical paradigm and for the regularization. The stable link with multivariate statistics comes from the fact that methods, such as principal component analysis, multivariate linear modeling, functional linear model, functional ANOVA, canonical correlation analysis, etc. can be applied within the functional data analysis framework.

3 In this study, pattern characteristics of daily average wind-rainfall in Peninsular Malaysia for the period 1985 to 2009 are investigated for ten stations. In FDA, a set of observations is transformed into a functional object, and statistical analysis is then performed on this continuous function, rather than on the original discrete data points. These make it possible to extract information from the temporal process as a whole, instead of merely point-by-point. In order to describe the characteristics of wind-rainfall data, FDA can represent the data in the form of smooth functions or curves that give meaningful information. The wind in Malaysia is generally light and varies; however, some uniform periodic changes in the wind flow patterns can also influence rainfall distribution. A strong wind is expected to bring heavy rainfall at the location. We can say that when there is strong wind, this can influence the rainfall. In other words, the heavy rain can happen because of the wind speed is higher. 1.2 Problem Statement In this analysis, two climate variables that often change throughout the year are daily mean wind speed and rainfall. The seasonal wind flow patterns paired with the local topographic features determine the rainfall distribution patterns over the area. By using FDA, the discrete data are transformed into curves. If the discrete values are assumed to be errorless, the process is an interpolation. But if they have some observational errors that need to be removed, the transformation from discrete data to the functions may involve smoothing. We use FDA to interpret wind-rainfall data with certain smooth functions that are assumed to underlie and to generate the observations. FDA methods are not necessarily based on the assumption for a single subject but values observed at different times. For calculating the coefficients of measurement error and to

4 compute the coefficients to obtain an optimal fit to wind-rainfall data, powerful basis expansion is used. However to avoid overfitting the data, a penalty on the roughness of the function is imposed. To examine the relationship between wind and rainfall, having a functional linear model is suitable for continuous time process. 1.3 Research Objectives The objectives of the study are: To determine the number of basis functions for each weather station. To summarize the pattern of wind-rainfall data using the functional descriptive statistics. To establish the relationship between wind and rainfall data using the concept of functional linear model. 1.4 Scope of the Study This study focuses on profiling ten stations for wind and rainfall data throughout Peninsular Malaysia. Kuala Krai, Batu Embun, Temerloh, Muadzam Shah, Mersing, Senai, Bayan Lepas, Cameron Highlands, Ipoh and Subang are among the ten selected stations in this study. In this analysis, we used daily windrainfall data for the period of 25 years which from the year 1985 until 2009. Data were obtained from Malaysian Meteorological Service (MMS).

5 1.5 Significance of Study The result of this study will give the advantage which is focused on how the FDA techniques can be used for wind-rainfall data analysis. This study can determine the patterns of each region of wind-rainfall data in Peninsular Malaysia. The FDA techniques can represent both climate data in the form of smooth curves for each region located. Functional data analysis also will produce important basis functions for this research. In statistics, this research will broaden the application of functional data analysis in our daily life.

63 REFERENCES Clarkson, D., Fraley, C., Gu, C.C., Ramsay, J.O., 2005. S+ Functional Data Analysis. Springer, US of America. Cuevas, A., Febrero, M., Fraiman, R., 2003. An Anova Test for Functional Data. Comput. Stat. Data Anal. 47, 111-122. Febrero, M., Galeano, P., Manteiga, W.G., 2007. A Functional Analysis of NOx Levels: Location and Scale Estimation and Outlier Detection. Comput. Stat. 22, 411-427. Ferraty, F., Vieu, P., Viguier-Pla, S., 2006. Factor-based Comparison of Groups of Curve. Comput. Stat. Data Anal. 51, 4903-4910. Froslie, K.F., 2012. Shape Information from Glucose Curves: Functional Data Analysis Compared with Traditional Summary Measures. BMC. Med. Rese. Method. 13 (6), 1471-2288. Gao, H.O., Niemeier, D.A., 2008. Using Functional Data Analysis of Diurnal Ozone and NOx Cycles to Inform Transportation Emissions Control. Trans. Research. Part D. 13, 221-238. Guo, M., Zhou, L., Huang, J.Z., Hardle, W.K., 2012. Functional Data Analysis of Generalized Quantile Regressions. SBP. Discussion Paper, 001. Hyndman, R.J., Booth, H., 2008. Stochastic Population Forecasts using Functional Data Models for Mortality, Fertility and Migration. Int. J. Forecasting. 24, 323-342.

64 Laukaitis, A., Rackauskas, A., 2004. Functional Data Analysis for Clients Segmentation Tasks. Euro. J. Operational Research. 163, 210-216. Manteiga, W.G., Vieu, P., 2007. Statistical for Functional Data. Comput. Stat. Data Anal. 51, 4788-4792. Muller, H.G., Stadtmuller, U., 2005. Generalized Functional Linear Model. Annals Stat. 33(2), 774-805. Muniz, C.D., Nieto, P.J.G., Fernandez, J.R.A., Torres, J.M., Taboada, J., 2012. Detection of Outliers in Water Quality Monitoring Samples using Functional Data Analysis in San Esteban Estuary. SN. Total Envi. 439, 54-61. Newell, J., McMillan, K., Grant, S., McCabe, G., 2004. Using Functional Data Analysis to Summarise and Interpret Lactate Curves. Comput. Bio. Medi. 36, 262-275. Nikitovic, V., 2011. Functional Data analysis in Forecasting Serbian Fertility. Institute of Social Sciences. 2, 73-89. Ramsay, J.O., Ramsey, J.B., 2002. Functional Data Analysis of the Dynamics of the Monthly Index of Nondurable Goods Production. J. Econometrics. 107, 327-344. Ramsay, J.O., Hooker, G., Graves, S., 2009. Functional Data Analysis with R and Matlab. Springer, New York. Ramsay, J.O., Silverman, B.W., 2005. Functional Data Analysis, second ed.springer, New York. Ratcliffe, S.J., Leader, L.R., Heller, G.Z., 2002. Functional data analysis with application to periodically stimulated foetal heart rate data. I: Functional regression. John Wiley & Sons, Ltd. 21, 1103-1114.

65 Song, J.J., Deng, W., Lee, H.J., Kwon, D., 2008. Optimal Classification for Time- Course Gene Expression Data using Functional Data Analysis. Comput. Bio. Chemis. 32, 426-432. Suhaila, J., Jemain, A.A., Hamdan, M.F., Zin, W.Z.W., 2011. Comparing Rainfall Pattern between regions in Peninsular Malaysia via a Functional Data Analysis. J. Hydro. 411, 197-206. Tian, T.S., 2010. Functional Data Analysis in Brain Imaging Studies. Front Psychol. 1, 35. Torres, J.M., Nieto, P.J.G., Alejano, L., Reyes, A.N., 2010. Detection of Outliers in Gas Emissions from Urban Areas using Functional Data Analysis. J. Hazard. Material. 186, 144-149.