We may not be able to do any great thing, but if each of us will do something, however small it may be, a good deal will be accomplished.

Size: px
Start display at page:

Download "We may not be able to do any great thing, but if each of us will do something, however small it may be, a good deal will be accomplished."

Transcription

1 Chapter 4 Initial Experiments with Localization Methods We may not be able to do any great thing, but if each of us will do something, however small it may be, a good deal will be accomplished. D.L. Moody Bioinformatics use a wide range of computational methods to solve the mysteries of life. Computational subcellular localization prediction taps the potential of computer science for problem solving. In subcellular localization prediction, computational methods are applied for two main purposes, first to identify the biological features that can influence the subcellular localization and the second is to use the identified biological features to make the prediction with highest accuracy. This chapter reports a wide verity of experiments that was initially conducted for identification of biological features that impact subcellular localization. Though these altogether were inconclusive, they set the stage for the final investigations, which are reported in next two chapters. Mainly, K-means clustering, Shannon value and Hilbert Huang Transform were explored for achieving computational subcellular localization prediction. The main observation from each method are discussed in this chapter. 48

2 4.1 K-means Clustering K-means clustering is an unsupervised learning algorithm which partition data into k clusters in which each data belongs to the cluster with the closest mean. The procedure can be e:>"''plained in the following step. First define k centroids from the data, one for each cluster. The next step is the assignment step which associate each data item to the nearest centroid. This is followed by the update step, which recalculates k new centroids. Both the assignment and update steps are repeated till no change in centroids. The algorithm aims at minimizing an objective function, a squared error function. The objective function is (4.1) where Xl, X2, X3,... X n are n observation to be clustered into k clusters, Cj is the centroid of cluster j and Ilx~j) - Cjl12 is a chosen distance measure between a data point and the cluster centre. Application of K means Clustering in Subcellular Localization Prediction As part of this study, the amino acids were clustered based on different physiochemical parameters like hydrophobicity, charge etc. Ninety six physiochemical indices, downloaded from AAIndex Database were used for clustering. A list of these physiochemical parameters is given in the appendix. Clusters ranging from 2 to 19 were tried out as there are 20 amino acids. The idea behind applying clustering in the study originated from the observation that groups of similar (e.g. hydrophobic) amino acids defines the protein sorting signal present in the N-terminal of amino acid sequences. For example, in the case of signal peptides, which are the address signals of secretory proteins, has a regional structure with N-terminal domain that is positively charged, a hydrophobic central region (h-region; leucines are most common), and a cleavage site region with mostly small residues. In mtp sequences, which are the address signals of the mitochondria proteins, there is an over representation of Arg, Ala and Ser, while negatively charged residues (Asp, Glu) are rare [26]. The first step in applying clustering is the translation of 49

3 u>- ~ :> u -< II.. 0 ~ a EIIP Hydrophobicity D Molecular Weight a M composition Absolute Entropy a Average Non-bounded Energy a lsoelectrlc Point a Mean Polarity Size o CLUSTER Figure 4.1: Variation in prediction accuracies for different clusters of amino acids the original amino acid sequence into the clustered amino acid sequence. For this, a physiochemical parameter and a value for k was used for clustering the amino acids into k groups. The entire amino acid sequence is translated into a k-ietter string, where each amino acid is replaced by the representative alphabet of the cluster in which the amino acid is a member. These clustered amino acid sequences were used for many different investigations. For example, compositions of various clusters were explored using the covariant discriminant algorithm [134] proposed by K. C. Chou. A graph of prediction accuracies against various clusters using this method is given in Figure 4.1. The clustered amino acid sequences of proteins of each location were plotted for a visual representation and inspection for leads. The major observation from clustering is that, it has negative impact on protein subcellular localization prediction. In addition to K-means clustering, the widely accepted KooIman classification [135] is used to cluster the amino acids. The KooIman classification groups the amino acids based on its prominent characteristics as aliphatic, sulphur-containing, aromatic, neutral, acidic, basic and imino acid. The 50

4 amino acid composition of sequences, clustered based on KooIman classification were investigated for a better prediction accuracy. Support Vector Machines were used for this. The dipeptide, tripeptide amino acid composition of these sequences were also investigated. The optimized RBF kernels gave an accuracy of for tripeptide frequency and for dipeptide frequency. 4.2 Shannon Index The Shannon index, also known as Shannon-Wiener Index or the Shannon Weaver Index [136], is one of the diversity indices used to measure diversity in categorical data. It is widely used in measurement of biodiversity, by treating species as symbols and their relative population sizes as the probability. The index can be computed as s H' = -l:pi lnpi - [(8-1)/2N] i=l (4.2) where ni is the number ofindividuals in species i; S is the number ofspecies. N is the total number of all individuals. Pi is the relative abundance of each species and is calculated as the proportion of individuals of a given species to the total number of individuals in the community R Application of Shannon Index in Subcellular Localization Prediction Shannon index is selected for exploration because it is a measure of information entropy of the distribution. The order of amino acids with in the sequence (sequence order effect) was already proved to have impact on the prediction accuracy of subcellular localization [137]. So it would be good to explore the impact of entropy of distribution of amino acids on subcellular localization prediction. In the case of amino acid sequence, the amino acids were treated as symbols and their frequency as the probability. Shannon entropy for the full length sequences from various locations were calculated and plotted. But these shared a very close range, making the Shannon index 51

5 a poor parameter for classification based on protein locations. The Shannon index of different regions in the protein sequence like, N terminal, middle region and C-terminal were calculated and investigated. Support Vector machine were applied on this for making prediction. But the accuracy was only below 60 and not competitive to use for prediction. Both clustering and Shannon index were combined and investigated. The amino acids were clustered using different physiochemical parameters and clusters from 2 to 19. Shannon values of clustered sequence was calculated and used for subcellular localization prediction. The accuracy was poor for this method also. A sliding window of size 100 was used to study how Shannon index or the entropy of distribution changes over the sequence. This study also did not lead to a positive conclusion. 4.3 Hilbert Huang Transform In most of the physical applications, the data obtained will be both nonlinear and non stationary. Hilbert Huang Transform is an effective method to analyze this kind of data. The Hilbert-Huang transform (HHT), is proposed by Huang et al [138]. It has two parts, the empirical mode decomposition (EMD) and the Hilbert spectral analysis (HSA). The HHT uses the EMD method to decompose a signal into intrinsic mode function(imfs), and uses the Hilbert Spectral Analysis (HSA) method on these IMFs to obtain instantaneous frequency data. The EMD has been tested and validated exhaustively, but only empirically. An IMF is defined as a function that satisfies the two conditions (1) In the whole data set, the number of extrema and the number of zero-crossings must either be equal or differ at most by one. (2) At any point, the mean value of the upper envelope defined by the local maxima and the lower envelope defined by the local minima is zero. The IMFs are calculated as follows. First identify, the local maxima and mininma of the data series x(t) and connect them to form upper and lower envelops. Their mean ml is subtracted from the x(t) to get hi. ie x(t) - ml = hi' If hi doest not satisfy the condition for IMF, the steps are repeated, then hi - mll = h ll. This sifting procedure is repeated k times until hik is an IMF, that is hick-i) - mik = hik Hence the 52

6 first IMF is obtained. Then for finding the next IMF, the calculated IMF is subtracted from the x(t) and the entire procedure is repeated. The IMFs are related to the data as in the following equation. N x(t) = - 'L1MFi(t) + Tn(t) i=l (4.3) where Tn(t) is the left over from the final iteration, from which no more IMFs can be calculated. Application of Hilbert Huang Transform in Subcellular Localization Prediction Feng Shi, Qiu-Jian Chen and Na-na Li had earlier used HHT for subcellular localization prediction [139]. They had used hydrophobicity index to translate amino acid sequence into numeric sequence and used the energy value of IMFs for predicting the subcellular localization of Apoptosis proteins. In this work, for effectively utilising the Hilbert Huang Transform(HHT) in subcellular localization prediction, the amino acid composition of each protein was treated as the numerical sequences representing the protein. Thus a fixed length signal was generated from varying length amino acid sequence. The highest instantaneous amplitude from first four IMFs were selected and combined with amino acid composition. Support Vector Machine was applied over this data for subcellular localization prediction and an accuracy of was obtained. Even though this accuracy seems worth reporting, the support vector machines for amino acid composition alone gave better accuracy. This pointed out that the selected parameter reduced the prediction accuracy. Instead of highest instantaneous amplitude, many other parameters like position of highest instantaneous amplitude, highest instantaneous frequency and position of highest instantaneous frequency were also tried out. The HHT was even applied for amino acid composition of selective group of amino acids like acidic, aromatic, basic, ionizable, polar hydrophilic, nonpolar hydrophobic etc. The prediction accuracy was less than 75. The major observations from this work was that considering all twenty amino acids in 53

7 all its vividness gives better accuracy than selecting only a subgroup of them. This observation is in agreement with clustering, where the amino acid clustering reduces the subcellular localization prediction accuracy. 4.4 Digital Signal Processing Digital Signal Processing (DSP) is a popular method in engineering and is the basis of many areas of technology, from mobile phones to modems and multimedia PCs. DSP is the processing of signals by digital means. In DSP, the signals can be represented and studied in various domains like time domain, spatial domain, frequency domain, autocorrelation domain, and wavelet domains. The best representation is determined by the nature of information represented by the signal. Most of the signals, as observed from nature, is in time or spatial domain representation. A discrete Fourier transform produces the frequency domain information, that is the frequency spectrum. The cross-correlation of the signal with itself over varying intervals of time or space constitute the autocorrelation. DSP can be applied to the biosequences. The character strings of biosequences can be translated to numerical signals using various mapping techniques. Upon these numerical signals, all DSP techniques can be applied and investigated. DSP has been successfully used in bioinformatics for gene finding, exon identification etc. [ ]. Application of Digital Signal Processing in Subcellular Localization Prediction A remarkable merit of using digital signal processing technique in subcellular localization is that many existing tools in mathematics and engineering can be straightforwardly used. Method in DSP such as Discrete Fourier Transform [112], Lyapunov index, Bessel function, Chebyshev filter [113] and Butterworth filter [114] were applied as techniques to bring forth the sequence order effect. In this study, the amino acid sequences were mapped to numerical sequences, mainly using the ninety six physiochemical parameters and k-means 54

8 clustering. Rather than deriving parameters to represent sequence properties, like in the earlier works, the entire signals were studied to bring out any hidden property that lead to identification of address signals present in the protein. The investigation was done by applying filters to enhance the property differences between the adjacent amino acids. This was tested by different physiochemical properties such as hydrophobicity, size. Differential filter was applied to find the impact of property differences in the amino acids on the subcellular location of protein. The filtered sequences were plotted for visual representation and investigation. The Fast Fourier Transforms (FFT) were applied to convert these preprocessed sequence into frequency domain. The FFTs were also plotted and investigated. Correlation also was applied upon the signals to detect the presence of sorting signal. Clustered indicator sequences were produced and investigated. For this, sequences were clustered using anyone of the ninety six physiochemical parameter and anyone of the cluster k ranging from 2 to 19. For K clusters, K indicator sequences can be generated. The K indicator sequences were studied by many approaches like finding the combined FFT. Even though many protein sequences are plotted using different DSP methods to bring out any hidden property of the address signal present in the protein sequences, none of them were able to reveal any significant information. 4.5 Conclusion This chapter presented various initial studies and investigations done on the biosequences for subcellular localization prediction. Even though the results were inconclusive, these investigations paved way for the successful prediction methods described in the next two chapters. It may be noted that the reported research [ ,139] show successful application of some of the methods tried, in different ways. For example, using parameters derived out of FFT and HHT had been effectively used for prediction. The many observations from the studies described in this chapter, lead to many new ideas. For example the decreasing prediction accuracy in clustering lead to the observation that including almost all biological features that can be significant will increase the prediction accuracy. This kindled ideas for experiments with full length sequences and considering the sequences as combination of domains. The ne>.."t chapter describes how these experiments and atomic composition were effectively used in subcellular localization prediction. 55

Lecture Hilbert-Huang Transform. An examination of Fourier Analysis. Existing non-stationary data handling method

Lecture Hilbert-Huang Transform. An examination of Fourier Analysis. Existing non-stationary data handling method Lecture 12-13 Hilbert-Huang Transform Background: An examination of Fourier Analysis Existing non-stationary data handling method Instantaneous frequency Intrinsic mode functions(imf) Empirical mode decomposition(emd)

More information

Hilbert-Huang and Morlet wavelet transformation

Hilbert-Huang and Morlet wavelet transformation Hilbert-Huang and Morlet wavelet transformation Sonny Lion (sonny.lion@obspm.fr) LESIA, Observatoire de Paris The Hilbert-Huang Transform The main objective of this talk is to serve as a guide for understanding,

More information

Time Series Analysis using Hilbert-Huang Transform

Time Series Analysis using Hilbert-Huang Transform Time Series Analysis using Hilbert-Huang Transform [HHT] is one of the most important discoveries in the field of applied mathematics in NASA history. Presented by Nathan Taylor & Brent Davis Terminology

More information

Ultrasonic Thickness Inspection of Oil Pipeline Based on Marginal Spectrum. of Hilbert-Huang Transform

Ultrasonic Thickness Inspection of Oil Pipeline Based on Marginal Spectrum. of Hilbert-Huang Transform 17th World Conference on Nondestructive Testing, 25-28 Oct 2008, Shanghai, China Ultrasonic Thickness Inspection of Oil Pipeline Based on Marginal Spectrum of Hilbert-Huang Transform Yimei MAO, Peiwen

More information

ORGANIC - BROWN 8E CH AMINO ACIDS AND PROTEINS.

ORGANIC - BROWN 8E CH AMINO ACIDS AND PROTEINS. !! www.clutchprep.com CONCEPT: INTRODUCTION TO PROTEINS Proteins are polypeptides that have some biological function. Peptides are composed of polymers of monomeric units called α-amino acids The 20 most

More information

An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162)

An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162) An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162) M. Ahmadizadeh, PhD, PE O. Hemmati 1 Contents Scope and Goals Review on transformations

More information

Study of nonlinear phenomena in a tokamak plasma using a novel Hilbert transform technique

Study of nonlinear phenomena in a tokamak plasma using a novel Hilbert transform technique Study of nonlinear phenomena in a tokamak plasma using a novel Hilbert transform technique Daniel Raju, R. Jha and A. Sen Institute for Plasma Research, Bhat, Gandhinagar-382428, INDIA Abstract. A new

More information

Analysis of N-terminal Acetylation data with Kernel-Based Clustering

Analysis of N-terminal Acetylation data with Kernel-Based Clustering Analysis of N-terminal Acetylation data with Kernel-Based Clustering Ying Liu Department of Computational Biology, School of Medicine University of Pittsburgh yil43@pitt.edu 1 Introduction N-terminal acetylation

More information

Bearing fault diagnosis based on EMD-KPCA and ELM

Bearing fault diagnosis based on EMD-KPCA and ELM Bearing fault diagnosis based on EMD-KPCA and ELM Zihan Chen, Hang Yuan 2 School of Reliability and Systems Engineering, Beihang University, Beijing 9, China Science and Technology on Reliability & Environmental

More information

Enhanced Active Power Filter Control for Nonlinear Non- Stationary Reactive Power Compensation

Enhanced Active Power Filter Control for Nonlinear Non- Stationary Reactive Power Compensation Enhanced Active Power Filter Control for Nonlinear Non- Stationary Reactive Power Compensation Phen Chiak See, Vin Cent Tai, Marta Molinas, Kjetil Uhlen, and Olav Bjarte Fosso Norwegian University of Science

More information

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki. Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein

More information

Mode Decomposition Analysis Applied to Study the Low-Frequency Embedded in the Vortex Shedding Process. Abstract

Mode Decomposition Analysis Applied to Study the Low-Frequency Embedded in the Vortex Shedding Process. Abstract Tainan,Taiwan,R.O.C., -3 December 3 Mode Decomposition Analysis Applied to Study the Low-Frequency Embedded in the Vortex Shedding Process Chin-Tsan Wang Department of Electrical Engineering Kau Yuan Institute

More information

AIR FORCE INSTITUTE OF TECHNOLOGY

AIR FORCE INSTITUTE OF TECHNOLOGY AN INQUIRY: EFFECTIVENESS OF THE COMPLEX EMPIRICAL MODE DECOMPOSITION METHOD, THE HILBERT-HUANG TRANSFORM, AND THE FAST-FOURIER TRANSFORM FOR ANALYSIS OF DYNAMIC OBJECTS THESIS Kristen L. Wallis, Second

More information

APPLICATIONS OF THE HUANG HILBERT TRANSFORMATION IN NON-INVASIVE RESEARCH OF THE SEABED IN THE SOUTHERN BALTIC SEA

APPLICATIONS OF THE HUANG HILBERT TRANSFORMATION IN NON-INVASIVE RESEARCH OF THE SEABED IN THE SOUTHERN BALTIC SEA Volume 17 HYDROACOUSTICS APPLICATIONS OF THE HUANG HILBERT TRANSFORMATION IN NON-INVASIVE RESEARCH OF THE SEABED IN THE SOUTHERN BALTIC SEA MIŁOSZ GRABOWSKI 1, JAROSŁAW TĘGOWSKI 2, JAROSŁAW NOWAK 3 1 Institute

More information

Engine fault feature extraction based on order tracking and VMD in transient conditions

Engine fault feature extraction based on order tracking and VMD in transient conditions Engine fault feature extraction based on order tracking and VMD in transient conditions Gang Ren 1, Jide Jia 2, Jian Mei 3, Xiangyu Jia 4, Jiajia Han 5 1, 4, 5 Fifth Cadet Brigade, Army Transportation

More information

Using modern time series analysis techniques to predict ENSO events from the SOI time series

Using modern time series analysis techniques to predict ENSO events from the SOI time series Nonlinear Processes in Geophysics () 9: 4 45 Nonlinear Processes in Geophysics c European Geophysical Society Using modern time series analysis techniques to predict ENSO events from the SOI time series

More information

Flow regime recognition in the spouted bed based on Hilbert-Huang transformation

Flow regime recognition in the spouted bed based on Hilbert-Huang transformation Korean J. Chem. Eng., 28(1), 308-313 (2011) DOI: 10.1007/s11814-010-0341-1 INVITED REVIEW PAPER Flow regime recognition in the spouted bed based on Hilbert-Huang transformation Wang Chunhua, Zhong Zhaoping,

More information

Approaches to the improvement of order tracking techniques for vibration based diagnostics in rotating machines

Approaches to the improvement of order tracking techniques for vibration based diagnostics in rotating machines Approaches to the improvement of order tracking techniques for vibration based diagnostics in rotating machines KeSheng Wang Supervisor : Prof. P.S. Heyns Department of Mechanical and Aeronautical Engineering

More information

EMD-BASED STOCHASTIC SUBSPACE IDENTIFICATION OF CIVIL ENGINEERING STRUCTURES UNDER OPERATIONAL CONDITIONS

EMD-BASED STOCHASTIC SUBSPACE IDENTIFICATION OF CIVIL ENGINEERING STRUCTURES UNDER OPERATIONAL CONDITIONS EMD-BASED STOCHASTIC SUBSPACE IDENTIFICATION OF CIVIL ENGINEERING STRUCTURES UNDER OPERATIONAL CONDITIONS Wei-Xin Ren, Department of Civil Engineering, Fuzhou University, P. R. China Dan-Jiang Yu Department

More information

A New Two-dimensional Empirical Mode Decomposition Based on Classical Empirical Mode Decomposition and Radon Transform

A New Two-dimensional Empirical Mode Decomposition Based on Classical Empirical Mode Decomposition and Radon Transform A New Two-dimensional Empirical Mode Decomposition Based on Classical Empirical Mode Decomposition and Radon Transform Zhihua Yang and Lihua Yang Abstract This paper presents a new twodimensional Empirical

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Motor imagery EEG discrimination using Hilbert-Huang Entropy.

Motor imagery EEG discrimination using Hilbert-Huang Entropy. Biomedical Research 017; 8 (): 77-733 ISSN 0970-938X www.biomedres.info Motor imagery EEG discrimination using Hilbert-Huang Entropy. Yuyi 1, Zhaoyun 1*, Li Surui, Shi Lijuan 1, Li Zhenxin 1, Dong Bingchao

More information

SUB-CELLULAR LOCALIZATION PREDICTION USING MACHINE LEARNING APPROACH

SUB-CELLULAR LOCALIZATION PREDICTION USING MACHINE LEARNING APPROACH SUB-CELLULAR LOCALIZATION PREDICTION USING MACHINE LEARNING APPROACH Ashutosh Kumar Singh 1, S S Sahu 2, Ankita Mishra 3 1,2,3 Birla Institute of Technology, Mesra, Ranchi Email: 1 ashutosh.4kumar.4singh@gmail.com,

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Empirical Wavelet Transform

Empirical Wavelet Transform Jérôme Gilles Department of Mathematics, UCLA jegilles@math.ucla.edu Adaptive Data Analysis and Sparsity Workshop January 31th, 013 Outline Introduction - EMD 1D Empirical Wavelets Definition Experiments

More information

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha Outline Goal is to predict secondary structure of a protein from its sequence Artificial Neural Network used for this

More information

SPECTRAL METHODS ASSESSMENT IN JOURNAL BEARING FAULT DETECTION APPLICATIONS

SPECTRAL METHODS ASSESSMENT IN JOURNAL BEARING FAULT DETECTION APPLICATIONS The 3 rd International Conference on DIAGNOSIS AND PREDICTION IN MECHANICAL ENGINEERING SYSTEMS DIPRE 12 SPECTRAL METHODS ASSESSMENT IN JOURNAL BEARING FAULT DETECTION APPLICATIONS 1) Ioannis TSIAFIS 1),

More information

BSc Project Fault Detection & Diagnosis in Control Valve

BSc Project Fault Detection & Diagnosis in Control Valve BSc Project Fault Detection & Diagnosis in Control Valve Supervisor: Dr. Nobakhti Content 2 What is fault? Why we detect fault in a control loop? What is Stiction? Comparing stiction with other faults

More information

Application of Hilbert-Huang signal processing to ultrasonic non-destructive testing of oil pipelines *

Application of Hilbert-Huang signal processing to ultrasonic non-destructive testing of oil pipelines * 13 Mao et al. / J Zhejiang Univ SCIENCE A 26 7(2):13-134 Journal of Zhejiang University SCIENCE A ISSN 19-395 http://www.zju.edu.cn/jzus E-mail: jzus@zju.edu.cn Application of Hilbert-Huang signal processing

More information

The prediction of membrane protein types with NPE

The prediction of membrane protein types with NPE The prediction of membrane protein types with NPE Lipeng Wang 1a), Zhanting Yuan 1, Xuhui Chen 1, and Zhifang Zhou 2 1 College of Electrical and Information Engineering Lanzhou University of Technology,

More information

HHT: the theory, implementation and application. Yetmen Wang AnCAD, Inc. 2008/5/24

HHT: the theory, implementation and application. Yetmen Wang AnCAD, Inc. 2008/5/24 HHT: the theory, implementation and application Yetmen Wang AnCAD, Inc. 2008/5/24 What is frequency? Frequency definition Fourier glass Instantaneous frequency Signal composition: trend, periodical, stochastic,

More information

Signal Period Analysis Based on Hilbert-Huang Transform and Its Application to Texture Analysis

Signal Period Analysis Based on Hilbert-Huang Transform and Its Application to Texture Analysis Signal Period Analysis Based on Hilbert-Huang Transform and Its Application to Texture Analysis Zhihua Yang 1, Dongxu Qi and Lihua Yang 3 1 School of Information Science and Technology Sun Yat-sen University,

More information

Fundamentals of the gravitational wave data analysis V

Fundamentals of the gravitational wave data analysis V Fundamentals of the gravitational wave data analysis V - Hilbert-Huang Transform - Ken-ichi Oohara Niigata University Introduction The Hilbert-Huang transform (HHT) l It is novel, adaptive approach to

More information

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST),

More information

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University Clustering and Model Integration under the Wasserstein Metric Jia Li Department of Statistics Penn State University Clustering Data represented by vectors or pairwise distances. Methods Top- down approaches

More information

The Empirical Mode Decomposition (EMD), a new tool for Potential Field Separation

The Empirical Mode Decomposition (EMD), a new tool for Potential Field Separation Journal of American Science 21;6(7) The Empirical Mode Decomposition (EMD), a new tool for Potential Field Separation S. Morris Cooper 1, Liu Tianyou 2, Innocent Ndoh Mbue 3 1, 2 Institude of Geophysucs

More information

Hilbert-Huang Transform versus Fourier based analysis for diffused ultrasonic waves structural health monitoring in polymer based composite materials

Hilbert-Huang Transform versus Fourier based analysis for diffused ultrasonic waves structural health monitoring in polymer based composite materials Proceedings of the Acoustics 212 Nantes Conference 23-27 April 212, Nantes, France Hilbert-Huang Transform versus Fourier based analysis for diffused ultrasonic waves structural health monitoring in polymer

More information

Amino Acids and Peptides

Amino Acids and Peptides Amino Acids Amino Acids and Peptides Amino acid a compound that contains both an amino group and a carboxyl group α-amino acid an amino acid in which the amino group is on the carbon adjacent to the carboxyl

More information

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...

More information

NMR Assignments using NMRView II: Sequential Assignments

NMR Assignments using NMRView II: Sequential Assignments NMR Assignments using NMRView II: Sequential Assignments DO THE FOLLOWING, IF YOU HAVE NOT ALREADY DONE SO: For Mac OS X, you should have a subdirectory nmrview. At UGA this is /Users/bcmb8190/nmrview.

More information

SENSITIVITY ANALYSIS OF ADAPTIVE MAGNITUDE SPECTRUM ALGORITHM IDENTIFIED MODAL FREQUENCIES OF REINFORCED CONCRETE FRAME STRUCTURES

SENSITIVITY ANALYSIS OF ADAPTIVE MAGNITUDE SPECTRUM ALGORITHM IDENTIFIED MODAL FREQUENCIES OF REINFORCED CONCRETE FRAME STRUCTURES SENSITIVITY ANALYSIS OF ADAPTIVE MAGNITUDE SPECTRUM ALGORITHM IDENTIFIED MODAL FREQUENCIES OF REINFORCED CONCRETE FRAME STRUCTURES K. C. G. Ong*, National University of Singapore, Singapore M. Maalej,

More information

Derivative-optimized Empirical Mode Decomposition for the Hilbert-Huang Transform

Derivative-optimized Empirical Mode Decomposition for the Hilbert-Huang Transform Derivative-optimized Empirical Mode Decomposition for the Hilbert-Huang Transform Peter C. Chu ), and Chenwu Fan ) Norden Huang 2) ) Naval Ocean Analysis and Prediction Laboratory, Department of Oceanography

More information

Statistical Movement Classification based on Hilbert-Huang Transform

Statistical Movement Classification based on Hilbert-Huang Transform University of Arkansas, Fayetteville ScholarWorks@UARK Theses and Dissertations 8-2017 Statistical Movement Classification based on Hilbert-Huang Transform Shanqi Sun University of Arkansas, Fayetteville

More information

The Study on the semg Signal Characteristics of Muscular Fatigue Based on the Hilbert-Huang Transform

The Study on the semg Signal Characteristics of Muscular Fatigue Based on the Hilbert-Huang Transform The Study on the semg Signal Characteristics of Muscular Fatigue Based on the Hilbert-Huang Transform Bo Peng 1,2, Xiaogang Jin 2,,YongMin 2, and Xianchuang Su 3 1 Ningbo Institute of Technology, Zhejiang

More information

Speech Based Gender Identification Using Empirical Mode Decomposition (EMD)

Speech Based Gender Identification Using Empirical Mode Decomposition (EMD) Speech Based Gender Identification Using Empirical Mode Decomposition (EMD) Mahpara Hyder Chowdhury Thesis Supervisor Dr. Hanif Bin Azhar DEPARTMENT OF COMPUTER SCIENCES & ENGINNERING SCHOOL OF COMPUTER

More information

Hilbert-Huang transform based physiological signals analysis for emotion recognition

Hilbert-Huang transform based physiological signals analysis for emotion recognition Hilbert-Huang transform based physiological signals analysis for emotion recognition Cong Zong and Mohamed Chetouani Université Pierre et Marie Curie Institut des Systèmes Intelligents et de Robotique,

More information

Empirical Mode Decomposition of Financial Data

Empirical Mode Decomposition of Financial Data International Mathematical Forum, 3, 28, no. 25, 1191-122 Empirical Mode Decomposition of Financial Data Konstantinos Drakakis 1 UCD CASL 2 University College Dublin Abstract We propose a novel method

More information

Multiscale Characterization of Bathymetric Images by Empirical Mode Decomposition

Multiscale Characterization of Bathymetric Images by Empirical Mode Decomposition Multiscale Characterization of Bathymetric Images by Empirical Mode Decomposition El-Hadji Diop & A.O. Boudraa, IRENav, Ecole Navale, Groupe ASM, Lanvéoc Poulmic, BP600, 29240 Brest-Armées, France A. Khenchaf

More information

Image Analysis Based on the Local Features of 2D Intrinsic Mode Functions

Image Analysis Based on the Local Features of 2D Intrinsic Mode Functions Image Analysis Based on the Local Features of 2D Intrinsic Mode Functions Dan Zhang Abstract The Hilbert-Huang transform (HHT) is a novel signal processing method which can efficiently handle nonstationary

More information

NWC Distinguished Lecture. Blind-source signal decomposition according to a general mathematical model

NWC Distinguished Lecture. Blind-source signal decomposition according to a general mathematical model NWC Distinguished Lecture Blind-source signal decomposition according to a general mathematical model Charles K. Chui* Hong Kong Baptist university and Stanford University Norbert Wiener Center, UMD February

More information

Derivative-Optimized Empirical Mode Decomposition (DEMD) for the Hilbert- Huang Transform

Derivative-Optimized Empirical Mode Decomposition (DEMD) for the Hilbert- Huang Transform Derivative-Optimized Empirical Mode Decomposition (DEMD) for the Hilbert- Huang Transform Peter C. Chu 1), Chenwu Fan 1), and Norden Huang 2) 1)Naval Postgraduate School Monterey, California, USA 2)National

More information

The use of the Hilbert transforra in EMG Analysis

The use of the Hilbert transforra in EMG Analysis The use of the Hilbert transforra in EMG Analysis Sean Tallier BSc, Dr P.Kyberd Oxford Orthopaedic Engineering Center Oxford University 1.1 INTRODUCTION The Fourier transform has traditionally been used

More information

Chap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University

Chap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University Chap 1. Overview of Statistical Learning (HTF, 2.1-2.6, 2.9) Yongdai Kim Seoul National University 0. Learning vs Statistical learning Learning procedure Construct a claim by observing data or using logics

More information

Digital Image Processing

Digital Image Processing Digital Image Processing, 2nd ed. Digital Image Processing Chapter 7 Wavelets and Multiresolution Processing Dr. Kai Shuang Department of Electronic Engineering China University of Petroleum shuangkai@cup.edu.cn

More information

Hilbert-Huang Transform-based Local Regions Descriptors

Hilbert-Huang Transform-based Local Regions Descriptors Hilbert-Huang Transform-based Local Regions Descriptors Dongfeng Han, Wenhui Li, Wu Guo Computer Science and Technology, Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of

More information

Chemical Properties of Amino Acids

Chemical Properties of Amino Acids hemical Properties of Amino Acids Protein Function Make up about 15% of the cell and have many functions in the cell 1. atalysis: enzymes 2. Structure: muscle proteins 3. Movement: myosin, actin 4. Defense:

More information

Hilbert Transforms in Signal Processing

Hilbert Transforms in Signal Processing Hilbert Transforms in Signal Processing Stefan L. Hahn Artech House Boston London Contents Preface xiii Introduction 1 Chapter 1 Theory of the One-Dimensional Hilbert Transformation 3 1.1 The Concepts

More information

K-means-based Feature Learning for Protein Sequence Classification

K-means-based Feature Learning for Protein Sequence Classification K-means-based Feature Learning for Protein Sequence Classification Paul Melman and Usman W. Roshan Department of Computer Science, NJIT Newark, NJ, 07102, USA pm462@njit.edu, usman.w.roshan@njit.edu Abstract

More information

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of Questions with Answers- Amino Acids & Peptides A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of 5.0-6.5 (Questions 1-4) 1. Which

More information

Housekeeping. Housekeeping. Molecules of Life: Biopolymers

Housekeeping. Housekeeping. Molecules of Life: Biopolymers Molecules of Life: Biopolymers Dr. Dale Hancock D.Hancock@mmb.usyd.edu.au Room 377 Biochemistry building Housekeeping Answers to the practise calculations and a narration are on WebT. Access these through

More information

WRITER IDENTIFICATION BASED ON OFFLINE HANDWRITTEN DOCUMENT IN KANNADA LANGUAGE

WRITER IDENTIFICATION BASED ON OFFLINE HANDWRITTEN DOCUMENT IN KANNADA LANGUAGE CHAPTER 5 WRITER IDENTIFICATION BASED ON OFFLINE HANDWRITTEN DOCUMENT IN KANNADA LANGUAGE 5.1 Introduction In earlier days it was a notion of the people who were using computers for their work have to

More information

Atomic Hemoglobin Code

Atomic Hemoglobin Code P a g e 64 Vol.10 Issue2 (Ver 1.0) October 2010 Abstract-This paper discusses cyberinformation studies of the amino acid composition of Hemoglobin protein sequences Q6WN27, in particular the identification

More information

Radial Basis Function Neural Networks in Protein Sequence Classification ABSTRACT

Radial Basis Function Neural Networks in Protein Sequence Classification ABSTRACT (): 195-04 (008) Radial Basis Function Neural Networks in Protein Sequence Classification Zarita Zainuddin and Maragatham Kumar School of Mathematical Sciences, University Science Malaysia, 11800 USM Pulau

More information

AN ALTERNATIVE ALGORITHM FOR EMPIRICAL MODE DECOMPOSITION. 1. Introduction

AN ALTERNATIVE ALGORITHM FOR EMPIRICAL MODE DECOMPOSITION. 1. Introduction AN ALTERNATIVE ALGORITHM FOR EMPIRICAL MODE DECOMPOSITION LUAN LIN, YANG WANG, AND HAOMIN ZHOU Abstract. The empirical mode decomposition (EMD) was a method pioneered by Huang et al [8] as an alternative

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning CS4731 Dr. Mihail Fall 2017 Slide content based on books by Bishop and Barber. https://www.microsoft.com/en-us/research/people/cmbishop/ http://web4.cs.ucl.ac.uk/staff/d.barber/pmwiki/pmwiki.php?n=brml.homepage

More information

Chaos, Complexity, and Inference (36-462)

Chaos, Complexity, and Inference (36-462) Chaos, Complexity, and Inference (36-462) Lecture 5: Symbolic Dynamics; Making Discrete Stochastic Processes from Continuous Deterministic Dynamics Cosma Shalizi 27 January 2009 Symbolic dynamics Reducing

More information

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features G. Y. Chen and B. Kégl Department of Computer Science and Operations Research, University of Montreal, CP 6128 succ.

More information

ON EMPIRICAL MODE DECOMPOSITION AND ITS ALGORITHMS

ON EMPIRICAL MODE DECOMPOSITION AND ITS ALGORITHMS ON EMPIRICAL MODE DECOMPOSITION AND ITS ALGORITHMS Gabriel Rilling, Patrick Flandrin and Paulo Gonçalvès Laboratoire de Physique (UMR CNRS 5672), École Normale Supérieure de Lyon 46, allée d Italie 69364

More information

Identification of Milling Status Using Vibration Feature Extraction Techniques and Support Vector Machine Classifier

Identification of Milling Status Using Vibration Feature Extraction Techniques and Support Vector Machine Classifier inventions Article Identification of Milling Status Using Vibration Feature Extraction Techniques and Support Vector Machine Classifier Che-Yuan Chang and Tian-Yau Wu * ID Department of Mechanical Engineering,

More information

Bearing fault diagnosis based on Shannon entropy and wavelet package decomposition

Bearing fault diagnosis based on Shannon entropy and wavelet package decomposition Bearing fault diagnosis based on Shannon entropy and wavelet package decomposition Hong Mei Liu 1, Chen Lu, Ji Chang Zhang 3 School of Reliability and Systems Engineering, Beihang University, Beijing 1191,

More information

Neural Networks. Nethra Sambamoorthi, Ph.D. Jan CRMportals Inc., Nethra Sambamoorthi, Ph.D. Phone:

Neural Networks. Nethra Sambamoorthi, Ph.D. Jan CRMportals Inc., Nethra Sambamoorthi, Ph.D. Phone: Neural Networks Nethra Sambamoorthi, Ph.D Jan 2003 CRMportals Inc., Nethra Sambamoorthi, Ph.D Phone: 732-972-8969 Nethra@crmportals.com What? Saying it Again in Different ways Artificial neural network

More information

Math 1314 Lesson 24 Maxima and Minima of Functions of Several Variables

Math 1314 Lesson 24 Maxima and Minima of Functions of Several Variables Math 1314 Lesson 24 Maxima and Minima of Functions of Several Variables We learned to find the maxima and minima of a function of a single variable earlier in the course We had a second derivative test

More information

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008 MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Invariant Scattering Convolution Networks

Invariant Scattering Convolution Networks Invariant Scattering Convolution Networks Joan Bruna and Stephane Mallat Submitted to PAMI, Feb. 2012 Presented by Bo Chen Other important related papers: [1] S. Mallat, A Theory for Multiresolution Signal

More information

LINOEP vectors, spiral of Theodorus, and nonlinear time-invariant system models of mode decomposition

LINOEP vectors, spiral of Theodorus, and nonlinear time-invariant system models of mode decomposition LINOEP vectors, spiral of Theodorus, and nonlinear time-invariant system models of mode decomposition Pushpendra Singh 1,2, 1 Department of EE, Indian Institute of Technology Delhi, India 2 Jaypee Institute

More information

Algorithm of Insulin Amino acid Gln Lutvo Kurić Bosnia and Herzegovina, Novi Travnik, Kalinska 7

Algorithm of Insulin Amino acid Gln Lutvo Kurić Bosnia and Herzegovina, Novi Travnik, Kalinska 7 Algorithm of Insulin Amino acid Gln Lutvo Kurić Bosnia and Herzegovina, Novi Travnik, Kalinska 7 Abstract: This paper discusses cyberinformation studies of the amino acid composition of insulin, amino

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Classification of handwritten digits using supervised locally linear embedding algorithm and support vector machine

Classification of handwritten digits using supervised locally linear embedding algorithm and support vector machine Classification of handwritten digits using supervised locally linear embedding algorithm and support vector machine Olga Kouropteva, Oleg Okun, Matti Pietikäinen Machine Vision Group, Infotech Oulu and

More information

Prediction of unknown deep foundation lengths using the Hilbert Haung Transform (HHT)

Prediction of unknown deep foundation lengths using the Hilbert Haung Transform (HHT) Prediction of unknown deep foundation lengths using the Hilbert Haung Transform (HHT) Ahmed T. M. Farid Associate Professor, Housing and Building National Research Center, Cairo, Egypt ABSTRACT: Prediction

More information

ANALYSIS OF TEMPORAL VARIATIONS IN TURBIDITY FOR A COASTAL AREA USING THE HILBERT-HUANG-TRANSFORM

ANALYSIS OF TEMPORAL VARIATIONS IN TURBIDITY FOR A COASTAL AREA USING THE HILBERT-HUANG-TRANSFORM ANALYSIS OF TEMPORAL VARIATIONS IN TURBIDITY FOR A COASTAL AREA USING THE HILBERT-HUANG-TRANSFORM Shigeru Kato, Magnus Larson 2, Takumi Okabe 3 and Shin-ichi Aoki 4 Turbidity data obtained by field observations

More information

Lecture 11: Unsupervised Machine Learning

Lecture 11: Unsupervised Machine Learning CSE517A Machine Learning Spring 2018 Lecture 11: Unsupervised Machine Learning Instructor: Marion Neumann Scribe: Jingyu Xin Reading: fcml Ch6 (Intro), 6.2 (k-means), 6.3 (Mixture Models); [optional]:

More information

Prognostics and Diagnostics of Rotorcraft Bearings

Prognostics and Diagnostics of Rotorcraft Bearings Prognostics and Diagnostics of Rotorcraft Bearings M. HAILE, A. GHOSHAL and D. LE ABSTRACT 1 This paper presents a diagnostic and prognostic approach for rotorcraft bearing health monitoring using the

More information

Machine Learning for Data Science (CS4786) Lecture 8

Machine Learning for Data Science (CS4786) Lecture 8 Machine Learning for Data Science (CS4786) Lecture 8 Clustering Course Webpage : http://www.cs.cornell.edu/courses/cs4786/2016fa/ Announcement Those of you who submitted HW1 and are still on waitlist email

More information

Applied Machine Learning for Biomedical Engineering. Enrico Grisan

Applied Machine Learning for Biomedical Engineering. Enrico Grisan Applied Machine Learning for Biomedical Engineering Enrico Grisan enrico.grisan@dei.unipd.it Data representation To find a representation that approximates elements of a signal class with a linear combination

More information

Machine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang.

Machine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang. Machine Learning CUNY Graduate Center, Spring 2013 Lectures 11-12: Unsupervised Learning 1 (Clustering: k-means, EM, mixture models) Professor Liang Huang huang@cs.qc.cuny.edu http://acl.cs.qc.edu/~lhuang/teaching/machine-learning

More information

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two Supplementary Figure 1. Biopanningg and clone enrichment of Alphabody binders against human IL 23. Positive clones in i phage ELISA with optical density (OD) 3 times higher than background are shown for

More information

Iterative filtering decomposition based on local spectral evolution kernel

Iterative filtering decomposition based on local spectral evolution kernel Iterative filtering decomposition based on local spectral evolution kernel Yang Wang 1, Guo Wei Wei 1, and Siyang Yang 1 1 Department of Mathematics Michigan State University, MI 4884, USA Department of

More information

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES Kedar Patki University of Rochester Dept. of Electrical and Computer Engineering kedar.patki@rochester.edu ABSTRACT The paper reviews the problem of

More information

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted

More information

Signal Modeling Techniques in Speech Recognition. Hassan A. Kingravi

Signal Modeling Techniques in Speech Recognition. Hassan A. Kingravi Signal Modeling Techniques in Speech Recognition Hassan A. Kingravi Outline Introduction Spectral Shaping Spectral Analysis Parameter Transforms Statistical Modeling Discussion Conclusions 1: Introduction

More information

Last updated: Copyright

Last updated: Copyright Last updated: 2012-08-20 Copyright 2004-2012 plabel (v2.4) User s Manual by Bioinformatics Group, Institute of Computing Technology, Chinese Academy of Sciences Tel: 86-10-62601016 Email: zhangkun01@ict.ac.cn,

More information

MULTISCALE CHANGES AND A TREND ANALYSIS OF THE FLUCTUATION OF HOUSEHOLD CONSUMPTION GROWTH IN CHINA. GUO Yue-ting 1, XU Jian-gang 2

MULTISCALE CHANGES AND A TREND ANALYSIS OF THE FLUCTUATION OF HOUSEHOLD CONSUMPTION GROWTH IN CHINA. GUO Yue-ting 1, XU Jian-gang 2 DOI:1.13959/j.issn.13-2398.211.5.13 211 5 121 :13-2398(211)5-9-5 1 2, (1., 2193;2., 2193) MULTISCALE CHANGES AND A TREND ANALYSIS OF THE FLUCTUATION OF HOUSEHOLD CONSUMPTION GROWTH IN CHINA GUO Yue-ting

More information

ECE 521. Lecture 11 (not on midterm material) 13 February K-means clustering, Dimensionality reduction

ECE 521. Lecture 11 (not on midterm material) 13 February K-means clustering, Dimensionality reduction ECE 521 Lecture 11 (not on midterm material) 13 February 2017 K-means clustering, Dimensionality reduction With thanks to Ruslan Salakhutdinov for an earlier version of the slides Overview K-means clustering

More information

K-means. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. November 19 th, Carlos Guestrin 1

K-means. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. November 19 th, Carlos Guestrin 1 EM Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University November 19 th, 2007 2005-2007 Carlos Guestrin 1 K-means 1. Ask user how many clusters they d like. e.g. k=5 2. Randomly guess

More information

Prediction of the subcellular location of apoptosis proteins based on approximate entropy

Prediction of the subcellular location of apoptosis proteins based on approximate entropy Prediction of the subcellular location of apoptosis proteins based on approximate entropy Chaohong Song Feng Shi* Corresponding author Xuan Ma College of Science, Huazhong Agricultural University, Wuhan,

More information

ECE 533 Final Project. channel flow

ECE 533 Final Project. channel flow ECE 533 Final Project Decomposing non-stationary turbulent velocity in open channel flow Ying-Tien Lin 25.2.2 Decomposing non-stationary turbulent velocity in open channel flow Ying-Tien Lin Introduction

More information

EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I Dr. Glen B Legge

EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I Dr. Glen B Legge EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I 2009 Dr. Glen B Legge This is a Scantron exam. All answers should be transferred to the Scantron sheet using a #2 pencil. Write and bubble

More information

Time-frequency analysis of seismic data using synchrosqueezing wavelet transform a

Time-frequency analysis of seismic data using synchrosqueezing wavelet transform a Time-frequency analysis of seismic data using synchrosqueezing wavelet transform a a Published in Journal of Seismic Exploration, 23, no. 4, 303-312, (2014) Yangkang Chen, Tingting Liu, Xiaohong Chen,

More information

Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines

Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines Article Prediction and Classif ication of Human G-protein Coupled Receptors Based on Support Vector Machines Yun-Fei Wang, Huan Chen, and Yan-Hong Zhou* Hubei Bioinformatics and Molecular Imaging Key Laboratory,

More information

CS 188: Artificial Intelligence Fall 2011

CS 188: Artificial Intelligence Fall 2011 CS 188: Artificial Intelligence Fall 2011 Lecture 20: HMMs / Speech / ML 11/8/2011 Dan Klein UC Berkeley Today HMMs Demo bonanza! Most likely explanation queries Speech recognition A massive HMM! Details

More information