Generalization of the Principal Components Analysis to Histogram Data
|
|
- Erik Ward
- 5 years ago
- Views:
Transcription
1 Generalization of the Principal Components Analysis to Histogram Data Oldemar Rodríguez 1, Edwin Diday 1, and Suzanne Winsberg 2 1 University Paris 9 Dauphine, Ceremade Pl Du Ml de L de Tassigny orodrigu@ceremadedauphinefr diday@ceremadedauphinefr 2 IRCAM, 1 Place Igor Stravinsky F 75004, Paris, FRANCE SuzanneWinsberg@ircamfr Abstract In this article we propose an algorithm for Principal Components Analysis when the variables are histogram type This algorithm also works if the data table has variables of interval type and histogram type mixed If all the variables are interval type it produces the same output as the one produced by the algorithm of the Centers Method propose in [5, Cazes, Chouakria, Diday and Schektman (1997)] 1 The algorithm In this algorithm we use the idea proposed in [9, Diday (1998)] We represent each histogram individual by a succession of k interval individuals (the first one included in the second one, the second one included in the third one and so on) where k is the maximum number of modalities taken by some variable in the input symbolic data table Instead of representing the histograms in the factorial plane, we are going to represent the Empirical Distribution Function F Y defined, in [3, Bock and Diday (2000)] associated with each histogram In other words if we have an histogram variable Y on a set E = {a 1, a 2, } of objects with domain Y represented by the mapping Y (a) = (U(a), π a ), for a E, where π a is frequency distribution, then in the algorithm we will use the function F (x) = i / π i x π i instead of the histogram Definition 1 Let X = (x ij ) i=1,2,,m, j=1,2,,n be a symbolic data table with variables type continuous, interval and histogram, and let be k = max{s, where s is the number of modalities of Y j, j = 1, 2,, n} where Y j is a variable of histogram type 1 We define the vector succession of intervals associated with each cell of X as: 1 If all the variables are interval type then k = 1
2 1 if x ij = [a, b] then the vector succession of intervals associated is: [a, b] x ij = [a, b] [a, b] 2 If x ij = (1(p 1 ), 2(p 2 ),, s(p s )) with s k (histogram) then the vector succession of intervals associated is: [0, p 1 ] x ij = [0, p 1 + p 2 ] [0, s w=1 p w] k 1 k 1 3 If x ij = a then the vector succession of intervals associated is: [a, a] x ij = [a, a] [a, a] Definition 2 Let X = (x ij ) i=1,2,,m, j=1,2,,n be a symbolic data table with variables type continuous, interval and histogram We define the matrix X = (x ij ) for i = 1, 2,, m and j = 1, 2,, n It is important to note that X has m k rows 2 and n columns [ ] (1(01), 2(04), 3(05)) (1(02), 2(03), 3(05)) Example 1 If X = then (1(07), 2(02), 3(01)) (1(08), 2(01), 3(01)) [00000, 01000] [00000, 02000] [00000, 05000] [00000, 05000] X = [00000, 10000] [00000, 10000] [00000, 07000] [00000, 08000] [00000, 09000] [00000, 09000] [00000, 10000] [00000, 10000] The idea is to apply the algorithm 3 proposed in [13, Rodríguez (2000)] to the matrix X With this Principal Components Analysis we can find the shape of the individual histogram in the principal plane However because all the individual histogram will be projected almost in the same position around the origin So we have to apply another principal components analysis in order to find a good cluster structure to the individual histogram Therefore we will apply a classical principal components analysis to the matrix presented in the followings definitions 2 k like in the previous definition k 1
3 Definition 3 Let X = (x ij ) i=1,2,,m, j=1,2,,n be a symbolic data table with variables type continuous, interval and histogram We define the row vector associated with each cell of X as: 1 If x ij = [a, b] then the row vector associated is: [ ] a + b x ij = If x ij = (1(p 1 ), 2(p 2 ),, s(p s )) where s is number of modalities of the j th variable, then the row vector associated is: x ij = [p 1, p 2,, p s ] 1 s 3 If x ij = a then the row vector associated is: x ij = [a] 1 1 Definition 4 Let X = (x ij ) i=1,2,,m, j=1,2,,n be a symbolic data table with variables type continuos, interval and histogram We define the matrix X = (x ij ) of m rows and p = n j=1 s j columns, where number of modalities of the variable If the variable j is histogram type, s j = 1 If the variable j is interval type, 1 If the variable j is continue type [ ] (1(01), 2(04), 3(05)) (1(02), 2(03), 3(05)) Example 2 If X = then (1(07), 2(02), 3(01)) (1(08), 2(01), 3(01)) [ ] X = The idea of the algorithm is to apply a principal components analysis to the matrix X to find the shape of the individual histogram, and then to apply another principal components analysis to the matrix X Using this last principal components, we will translate the individual histogram to find the cluster structure of individual histogram in the principal plane Algorithm 1: Histogram Principal Components Analysis Input : m =number of symbolic objects n =number of symbolic variables The symbolic data table X = x 11 x 12 x 1n x 21 x 22 x 2n x m1 x m2 x mn
4 Output : The symbolic matrix with the first q principal components: y 11 y 12 y 1q y 21 Y = y 22 y 2q, y m1 y m2 y mq where (k like in definition 1): [ ] yij 1, y1 ij [ ] y ij = y 2 ij, y2 ij [ ] yij k, yk ij Step 1: Compute the matrix X of the definition 2 Step 2: Apply the algorithm 3 proposed in [13, Rodríguez (2000)] taking as input X It will produce the matrix: ŷ 11 ŷ 12 ŷ 1q 1 ŷ Ŷ 21 = ŷ 22 ŷ 2q 1, ŷ m1 ŷ m2 ŷ mq 1 where (k like in definition 1): [ ] ŷij 1, ŷ1 ij [ ] ŷ ij = ŷ 2 ij, ŷ2 ij [ ] ŷij k, ŷk ij for i = 1, 2,, n and j = 1, 2,, q 1 with q 1 n Step 3: Compute the matrix X of the definition 4 Step 4: Apply a classical principal components analysis to the matrix X It will produce the matrix: Ỹ = ỹ 11 ỹ 21 ỹ 12 ỹ 1q2 ỹ 22 ỹ 2q2 ỹ m1 ỹ m2 ỹ mq2 where q 2 p = n j=1 s j (s j like in definition 4):,
5 Step 5: q = min(q 1, q 2 ) Step 6: Compute the first q principal components: using the translation: Step 7: End of the algorithm y 11 y 12 y 1q y 21 Y = y 22 y 2q, y m1 y m2 y mq [ ] [ ] yij 1, y1 ij ŷij 1 [ ] + ỹ ij, ŷ1 ij + ỹ ij [ ] y ij = yij 2, y2 ij ŷij 2 = + ỹ ij, ŷ2 ij + ỹ ij [ ] [ ] yij k, yk ij ŷij k + ỹ ij, ŷk ij + ỹ ij 2 Examples To illustrate how the algorithm works in this section we present two examples Example 3 In this example we present the execution of the algorithm 1 with the symbolic data table presented in (1) This matrix has five variables, the first one is interval type, the second one is a variable quantitative discrete, and the last three variables are histogram type (the values are truncated) X = [1, 4] 2 (1(04), 2(01), 3(02), 4(007), 5(002)) (1(01), 2(09)) (1(07), 2(02)) [1, 4] 3 (1(06), 2(01), 3(01), 5(00)) (1(01), 2(09)) (1(07), 2(02)) [1, 5] 2 (1(07), 2(02)) (1(00), 2(09)) (1(07), 2(02)) [1, 4] 1 (1(07), 2(00), 3(01), 4(00), 5(00), 6(00), 7(00)) (1(00), 2(09)) (1(07), 2(02)) [1, 4] 1 (1(04), 3(04), 4(00), 5(00)) (1(00), 2(09)) (1(08), 2(01)) [1, 6] 2 (2(04), 3(01), 4(03), 5(00), 6(00), 7(00)) (1(00), 2(09)) (1(07), 2(02)) (1) Applying the algorithm 1 proposed above, we get the principal plane of Figure 1 If we plot the pyramid (see Figure 2) associated with the matrix (1) we get the same cluster structure as the one obteined it in the principal plane of Figure 1 The individuals East midlands non-metropolitan and Northern Ireland are isolated and the individuals North non-metropolitan, Yorks and Humberside metropoli, Yorks and Humberside non-metro and East midlands non-metropolitan are grouped
6 Fig 1 Principal plane with data of continuous, interval and histogram type Fig 2 Pyramid with data of continuous, interval and histogram type 3 The interpretation To explain how to interpret the Histogram Principal Components Analysis we will use one small example The interpretation of the position of the histogram individual in the principal plane is the same as in the classical principal components analysis situation We shall explain the interpretation of the succession of rectangles that represents each individual
7 Example 4 Let be VAR-1 VAR-2 X = IND-1 (1(01), 2(04), 3(05)) (1(02), 2(03), 3(05)) IND-2 (1(07), 2(02), 3(01)) (1(08), 2(01), 3(01)) This matrix can be also represent like we show in the Figure 3 Fig 3 Data table with two individus and two histogram variables If we apply the Histogram Principal Components Analysis to the previous data table we get the principal plane that we show in the Figure 4 The smallest rectangle of the projection of the individual 1(Ind1) represents the probability that individual 1 takes the modality 1 for the variable 1 or the modality 1 for the variable 2 The size of the rectangle agrees with the representation of the individual 1 in the Figure 3, because the value of the modality 1 for the variable 1 is 01 and the value of the modality 1 for the variable 2 is 02, ie the mean for the modality 1 is 015 The second rectangle of the projection of the individual 1 represents the probability that individual 1 takes the modality 1 or the modality 2 for the variable 1, or the probability that individual 1 takes the modality 1 or modality 2 for the variable 2 The size of the second rectangle also agrees with the representation of individual 1 in the Figure 3, because the value of the empirical distribution function for the modality 2 of the variable 1 is 05 and the value of the empirical distribution function for the modality 2 of the variable 2 is also 05 The third rectangle of individual 1 represents the probability 1, that is the probability that individual 1 takes any of the modalities
8 The smallest rectangle of the projection of individual 2 (Ind2) is bigger than the smallest rectangle of the projection of the individual 1 (see Figure 4); it is consistent with the interpretation, because the probability for individual 2 to take the modality 1 for the variable 1 is 07 and the probability for individual 2 to take the modality 1 for the variable 2 is 08, ie the mean of taken the modality 1 is 075 This value is bigger than the same value for individual 1 that is 15; that s why, the smallest rectangle of the projection of Ind1 is smaller than the smallest rectangle of the projection of Ind2 For the same reasons the second rectangle of the projection of Ind1 is smaller than the second rectangle of the projection of Ind2 Fig 4 Histogram Principal Component Plane References 1 Bertrand P et Goupil F Descriptive statistics for symbolic data, In Symbolic official data analysis, Springer, , Billard L and Diday E Regression analysis for interval value data, In data analysis,classification and related methods, Eds Kiers H, Rasson J, Groenen P and Schader M, IFCS Bock H-H and Diday E (eds) Analysis of Symbolic Data Exploratory methods for extracting statistical information from complex data Springer Verlag, Heidelberg, 425 pages, ISBN , Brito P Analyse de donnees symboliques: Pyramides d heritage, Thèse de doctorat, Université Paris IX Dauphine, 1991
9 5 Cazes P, Chouakria A, Diday E et Schektman Y Extension de l analyse en composantes principales à des données de type intervalle, Rev Statistique Appliquée, Vol XLV Num 3 pag 5-24, Francia, Chouakria A Extension des méthodes d analysis factorialle à des données de type intervalle, Thèse de doctorat, Université Paris IX Dauphine, Diday E Lemaire J, Pouget J, Testu F Eléments d Analyse des données Dunod, Paris, Diday E Introduction l approche symbolique en Analyse des Donnés Première Journées Symbolique-Numérique Université Paris IX Dauphine Décembre Diday E L Analyse des Donnes Symboliques: un cadre théorique et des outils Cahiers du CEREMADE, Diday E An Introduction to symbolic data analysis ans its application to the sodas project: purpose, history and perspective, Paris IX University Dauphine, Paris, Gettler Summa M Factorial axis interpretation by symbolic objects, Actes des Journées Symbolique-Numérique, Ed E Diday, Y Kodratoff, S Pinson Editeurs Univ Paris IX Dauphine 12 Rodríguez O, Symbolic correlation circle in principal components analysis, IFCS Rodríguez O, The Duality Problem in Symbolic Principal Components Analysis Also in this book 14 Pollaillon G Organisation et interprétation par les treillis de Galois de données de type multivalué, intervalle ou histogramme Thése de doctorat, Université Paris IX Dauphine, Saporta, G, L Analyse des Données Que sais-je? Presses Universitaires de France, Paris, 1980
complex data Edwin Diday, University i Paris-Dauphine, France CEREMADE, Beijing 2011
Symbolic data analysis of complex data Edwin Diday, CEREMADE, University i Paris-Dauphine, France Beijing 2011 OUTLINE What is the Symbolic Data Analysis (SDA) paradigm? Why SDA is a good tool for Complex
More informationINTERSTATIS: THE STATIS METHOD FOR INTERVAL VALUED DATA INTERSTATIS: EL MÉTODO STATIS PARA DATOS DE TIPO INTERVALO
REVISTA DE MATEMÁTICA: TEORÍA Y APLICACIONES 2014 21(1) : 73 83 CIMPA UCR ISSN:1409-2433 INTERSTATIS: THE STATIS METHOD FOR INTERVAL VALUED DATA INTERSTATIS: EL MÉTODO STATIS PARA DATOS DE TIPO INTERVALO
More informationDecision trees on interval valued variables
Decision trees on interval valued variables Chérif Mballo 1,2 and Edwin Diday 2 1 ESIEA Recherche, 38 Rue des Docteurs Calmette et Guerin 53000 Laval France mballo@esiea-ouest.fr 2 LISE-CEREMADE Université
More informationPrincipal Component Analysis with Interval Imputed missing values
Advances in Statistical Analysis manuscript No. (will be inserted by the editor) Principal Component Analysis with Interval Imputed missing values Paola Zuccolotto Received: date / Accepted: date Abstract
More informationModelling and Analysing Interval Data
Modelling and Analysing Interval Data Paula Brito Faculdade de Economia/NIAAD-LIACC, Universidade do Porto Rua Dr. Roberto Frias, 4200-464 Porto, Portugal mpbrito@fep.up.pt Abstract. In this paper we discuss
More informationPrincipal Component Analysis for Interval Data
Outline Paula Brito Fac. Economia & LIAAD-INESC TEC, Universidade do Porto ECI 2015 - Buenos Aires T3: Symbolic Data Analysis: Taking Variability in Data into Account Outline Outline 1 Introduction to
More informationA new linear regression model for histogram-valued variables
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 011, Dublin (Session CPS077) p.5853 A new linear regression model for histogram-valued variables Dias, Sónia Instituto Politécnico Viana do
More informationA Resampling Approach for Interval-Valued Data Regression
A Resampling Approach for Interval-Valued Data Regression Jeongyoun Ahn, Muliang Peng, Cheolwoo Park Department of Statistics, University of Georgia, Athens, GA, 30602, USA Yongho Jeon Department of Applied
More informationDependencies in Interval-valued. valued Symbolic Data. Lynne Billard University of Georgia
Dependencies in Interval-valued valued Symbolic Data Lynne Billard University of Georgia lynne@stat.uga.edu Tribute to Professor Edwin Diday: Paris, France; 5 September 2007 Naturally occurring Symbolic
More informationPCR and PLS for Clusterwise Regression on Functional Data
PCR and PLS for Clusterwise Regression on Functional Data Cristian Preda 1 and Gilbert Saporta 2 1 Faculté de Médecine, Université de Lille 2 CERIM - Département de Statistique 1, Place de Verdun, 5945
More informationLasso-based linear regression for interval-valued data
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS065) p.5576 Lasso-based linear regression for interval-valued data Giordani, Paolo Sapienza University of Rome, Department
More informationDescriptive Statistics for Symbolic Data
Outline Descriptive Statistics for Symbolic Data Paula Brito Fac. Economia & LIAAD-INESC TEC, Universidade do Porto ECI 2015 - Buenos Aires T3: Symbolic Data Analysis: Taking Variability in Data into Account
More informationGeneralized Ward and Related Clustering Problems
Generalized Ward and Related Clustering Problems Vladimir BATAGELJ Department of mathematics, Edvard Kardelj University, Jadranska 9, 6 000 Ljubljana, Yugoslavia Abstract In the paper an attempt to legalize
More informationAssessing Sample Variability in the Visualization Techniques related to Principal Component Analysis : Bootstrap and Alternative Simulation Methods.
Draft from : COMPSTAT, Physica Verlag, Heidelberg, 1996, Alberto Prats, Editor, p 205 210. Assessing Sample Variability in the Visualization Techniques related to Principal Component Analysis : Bootstrap
More informationNotations and Abbreviations
Notations and Abbreviations Sets, relations, quantors Symbol argma:r ==> : 1\ v E rf. c c IAI or #A {x:... } or {x I... } c(a) or A Name relation relation product relation definition correspondence
More informationDistributions are the numbers of today From histogram data to distributional data. Javier Arroyo Gallardo Universidad Complutense de Madrid
Distributions are the numbers of today From histogram data to distributional data Javier Arroyo Gallardo Universidad Complutense de Madrid Introduction 2 Symbolic data Symbolic data was introduced by Edwin
More informationOn central tendency and dispersion measures for intervals and hypercubes
On central tendency and dispersion measures for intervals and hypercubes Marie Chavent, Jérôme Saracco To cite this version: Marie Chavent, Jérôme Saracco. On central tendency and dispersion measures for
More informationNEW ADVANCES IN SYMBOLIC DATA ANALYSIS and SPATIAL CLASSIFICATION. Edwin Diday Paris Dauphine University
NEW ADVANCES IN SYMBOLIC DATA ANALYSIS and SPATIAL CLASSIFICATION. Edwin Diday Paris Dauphine University Remembering Suzanne WINSBERG This talk is dedicated to her OUTLINE PART 1: SYMBOLIC DATA ANALYSIS
More informationA Nonparametric Kernel Approach to Interval-Valued Data Analysis
A Nonparametric Kernel Approach to Interval-Valued Data Analysis Yongho Jeon Department of Applied Statistics, Yonsei University, Seoul, 120-749, Korea Jeongyoun Ahn, Cheolwoo Park Department of Statistics,
More informationDiscriminant Analysis for Interval Data
Outline Discriminant Analysis for Interval Data Paula Brito Fac. Economia & LIAAD-INESC TEC, Universidade do Porto ECI 2015 - Buenos Aires T3: Symbolic Data Analysis: Taking Variability in Data into Account
More informationMultilevel Clustering for large Databases
Multilevel Clustering for large Databases Yves Lechevallier and Antonio Ciampi INRIA-Rocquencourt, 7853 Le Chesnay CEDEX, France Department of Epidemiology & Biostatistics, McGill University, Montreal,
More informationLong-term structural health monitoring of a steel road-rail bridge based on symbolic data analysis
Long-term structural health monitoring of a steel road-rail bridge based on symbolic data analysis A. D. Orcesi IFFSTAR, Paris, France A. Cury Paris-Est University, IFSTTAR, Paris, France C. F. Cremona
More informationDiscriminant Analysis on Mixed Predictors
Chapter 1 Discriminant Analysis on Mixed Predictors R. Abdesselam Abstract The processing of mixed data - both quantitative and qualitative variables - cannot be carried out as explanatory variables through
More informationMethods for territorial intelligence.
Methods for territorial intelligence. Serge Ormaux To cite this version: Serge Ormaux. Methods for territorial intelligence.. In International Conference of Territorial Intelligence, Sep 2006, Alba Iulia,
More informationStructural Segregation: Assessing the impact of South African Apartheid on Underlying Dynamics of Interactions between Networks and Territories
Structural Segregation: Assessing the impact of South African Apartheid on Underlying Dynamics of Interactions between Networks and Territories J. Raimbault 1,2, and S. Baffi 1,3 juste.raimbault@polytechnique.edu
More informationNew Clustering methods for interval data
New Clustering methods for interval data Marie Chavent, Francisco de A.T. De Carvahlo, Yves Lechevallier, Rosanna Verde To cite this version: Marie Chavent, Francisco de A.T. De Carvahlo, Yves Lechevallier,
More informationHistogram data analysis based on Wasserstein distance
Histogram data analysis based on Wasserstein distance Rosanna Verde Antonio Irpino Department of European and Mediterranean Studies Second University of Naples Caserta - ITALY Aims Introduce: New distances
More informationHistogram data analysis based on Wasserstein distance
Histogram data analysis based on Wasserstein distance Rosanna Verde Antonio Irpino Department of European and Mediterranean Studies Second University of Naples Caserta - ITALY SYMPOSIUM ON LEARNING AND
More informationGeneralizing Partial Least Squares and Correspondence Analysis to Predict Categorical (and Heterogeneous) Data
Generalizing Partial Least Squares and Correspondence Analysis to Predict Categorical (and Heterogeneous) Data Hervé Abdi, Derek Beaton, & Gilbert Saporta CARME 2015, Napoli, September 21-23 1 Outline
More informationFunctional Radial Basis Function Networks (FRBFN)
Bruges (Belgium), 28-3 April 24, d-side publi., ISBN 2-9337-4-8, pp. 313-318 Functional Radial Basis Function Networks (FRBFN) N. Delannay 1, F. Rossi 2, B. Conan-Guez 2, M. Verleysen 1 1 Université catholique
More informationProfiling and Labour Market Accessibility for the Graduates in Economics at Naples University
Profiling and Labour Market Accessibility for the Graduates in Economics at Naples University Simona Balbi, Maria Gabriella Grassia Department of Mathematics and Statistics, University of Naples Federico
More informationPermuting the partitions of a prime
Journal de Théorie des Nombres de Bordeaux 00 (XXXX), 000 000 Permuting the partitions of a prime par Stéphane VINATIER Résumé. Étant donné un nombre premier p impair, on caractérise les partitions l de
More informationOrder statistics for histogram data and a box plot visualization tool
Order statistics for histogram data and a box plot visualization tool Rosanna Verde, Antonio Balzanella, Antonio Irpino Second University of Naples, Caserta, Italy rosanna.verde@unina.it, antonio.balzanella@unina.it,
More informationSocial Network Analysis of the League of Nations Intellectual Cooperation, an Historical Distant Reading
Social Network Analysis of the League of Nations Intellectual Cooperation, an Historical Distant Reading Martin Grandjean To cite this version: Martin Grandjean. Social Network Analysis of the League of
More informationHow to cope with modelling and privacy concerns? A regression model and a visualization tool for aggregated data
for aggregated data Rosanna Verde (rosanna.verde@unina2.it) Antonio Irpino (antonio.irpino@unina2.it) Dominique Desbois (desbois@agroparistech.fr) Second University of Naples Dept. of Political Sciences
More informationStatistiques en grande dimension
Statistiques en grande dimension Christophe Giraud 1,2 et Tristan Mary-Huart 3,4 (1) Université Paris-Sud (2) Ecole Polytechnique (3) AgroParistech (4) INRA - Le Moulon M2 MathSV & Maths Aléa C. Giraud
More informationCOMPARATIVE STUDY BETWEEN LEMKE S METHOD AND THE INTERIOR POINT METHOD FOR THE MONOTONE LINEAR COMPLEMENTARY PROBLEM
STUDIA UNIV. BABEŞ BOLYAI, MATHEMATICA, Volume LIII, Number 3, September 2008 COMPARATIVE STUDY BETWEEN LEMKE S METHOD AND THE INTERIOR POINT METHOD FOR THE MONOTONE LINEAR COMPLEMENTARY PROBLEM ADNAN
More informationWorkshop. From data to classes and classes as new statistical units. UNIVERSITY PARIS-DAUPHINE January 10-11, 2019
Workshop ADVANCES IN DATA SCIENCE FOR BIG AND COMPLEX DATA From data to classes and classes as new statistical units UNIVERSITY PARIS-DAUPHINE January 10-11, 2019 ROOM C108 First floor In all domains of
More informationChapter 10 Conjoint Use of Variables Clustering and PLS Structural Equations Modeling
Chapter 10 Conjoint Use of Variables Clustering and PLS Structural Equations Modeling Valentina Stan and Gilbert Saporta Abstract In PLS approach, it is frequently assumed that the blocks of variables
More informationA DIFFERENT APPROACH TO MULTIPLE CORRESPONDENCE ANALYSIS (MCA) THAN THAT OF SPECIFIC MCA. Odysseas E. MOSCHIDIS 1
Math. Sci. hum / Mathematics and Social Sciences 47 e année, n 86, 009), p. 77-88) A DIFFERENT APPROACH TO MULTIPLE CORRESPONDENCE ANALYSIS MCA) THAN THAT OF SPECIFIC MCA Odysseas E. MOSCHIDIS RÉSUMÉ Un
More informationCYCLOGENESIS FORECASTING IN THE SOUTH INDIAN OCEAN BASIN 1 *MAXWELL Djaffard 1
ABSTRACT. CYCLOGENESIS FORECASTING IN THE SOUTH INDIAN OCEAN BASIN 1 *MAXWELL Djaffard 1 djafmax@yahoo.fr Miloud BESSAFI 2 bessafi@univ-reunion.fr RABEARISOA Jean Marc 1 haris_marc@yahoo.fr RAKOTOARISOA
More informationForecasting Complex Time Series: Beanplot Time Series
COMPSTAT 2010 19 International Conference on Computational Statistics Paris-France, August 22-27 Forecasting Complex Time Series: Beanplot Time Series Carlo Drago and Germana Scepi Dipartimento di Matematica
More informationPERFORMANCE OF THE EM ALGORITHM ON THE IDENTIFICATION OF A MIXTURE OF WAT- SON DISTRIBUTIONS DEFINED ON THE HYPER- SPHERE
REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 111 130 PERFORMANCE OF THE EM ALGORITHM ON THE IDENTIFICATION OF A MIXTURE OF WAT- SON DISTRIBUTIONS DEFINED ON THE HYPER- SPHERE Authors: Adelaide
More informationSome Sparse Methods for High Dimensional Data. Gilbert Saporta CEDRIC, Conservatoire National des Arts et Métiers, Paris
Some Sparse Methods for High Dimensional Data Gilbert Saporta CEDRIC, Conservatoire National des Arts et Métiers, Paris gilbert.saporta@cnam.fr A joint work with Anne Bernard (QFAB Bioinformatics) Stéphanie
More informationInterval-Based Composite Indicators
University of Rome Niccolo Cusano Conference of European Statistics Stakeholders 22 November 2014 1 Building Composite Indicators 2 (ICI) 3 Constructing ICI 4 Application on real data Composite Indicators
More informationEntropy based principle and generalized contingency tables
Entropy based principle and generalized contingency tables Vincent Vigneron To cite this version: Vincent Vigneron Entropy based principle and generalized contingency tables 14th European Symposium on
More informationFuzzy histograms and density estimation
Fuzzy histograms and density estimation Kevin LOQUIN 1 and Olivier STRAUSS LIRMM - 161 rue Ada - 3439 Montpellier cedex 5 - France 1 Kevin.Loquin@lirmm.fr Olivier.Strauss@lirmm.fr The probability density
More informationArtificial Neural Networks for Modeling Partial Differential Equations Solution: Application to MicroSystems Simulation.
Artificial Neural Networks for Modeling Partial Differential Equations Solution: Application to MicroSystems Simulation. Efrain Jaime Ang 1, Bruno Jammes 1.- Universidad Autonoma de Tamaulipas, Mexico
More informationarxiv: v1 [stat.ml] 23 Jul 2015
CLUSTERING OF MODAL VALUED SYMBOLIC DATA VLADIMIR BATAGELJ, NATAŠA KEJŽAR, AND SIMONA KORENJAK-ČERNE UNIVERSITY OF LJUBLJANA, SLOVENIA arxiv:157.6683v1 [stat.ml] 23 Jul 215 Abstract. Symbolic Data Analysis
More informationMulti-Layer Perceptrons for Functional Data Analysis: a Projection Based Approach 0
Multi-Layer Perceptrons for Functional Data Analysis: a Projection Based Approach 0 Brieuc Conan-Guez 1 and Fabrice Rossi 23 1 INRIA, Domaine de Voluceau, Rocquencourt, B.P. 105 78153 Le Chesnay Cedex,
More informationEstimation of the transmissmty of the Santiago aquifer, Chile, using different geostatistical methods
Groundwater Management: Quantity and Quality (Proceedings of the Benidorm Symposium, October 1989). IAHS Publ. no. 188,1989. Estimation of the transmissmty of the Santiago aquifer, Chile, using different
More informationA NON-PARAMETRIC TEST FOR NON-INDEPENDENT NOISES AGAINST A BILINEAR DEPENDENCE
REVSTAT Statistical Journal Volume 3, Number, November 5, 155 17 A NON-PARAMETRIC TEST FOR NON-INDEPENDENT NOISES AGAINST A BILINEAR DEPENDENCE Authors: E. Gonçalves Departamento de Matemática, Universidade
More informationFunctional Preprocessing for Multilayer Perceptrons
Functional Preprocessing for Multilayer Perceptrons Fabrice Rossi and Brieuc Conan-Guez Projet AxIS, INRIA, Domaine de Voluceau, Rocquencourt, B.P. 105 78153 Le Chesnay Cedex, France CEREMADE, UMR CNRS
More informationThree-way component models for imprecise data
Three-way component models for imprecise data Paolo Giordani paolo.giordani@uniroma1.it Sapienza University of Rome TRICAP, Vall de Núria Spain, June 15th, 2009 Outline 1 Uncertainty Some clarifications
More informationAn Adjoint Approach for Stabilizing the Parareal Method
An Adjoint Approach for Stabilizing the Parareal Method Feng Chen a, Jan S Hesthaven b, Yvon Maday c, Allan Nielsen b a Department of Mathematics, Baruch College, New York, NY 10010, USA b Mathematics
More informationMultidimensional Joint Graphical Display of Symmetric Analysis: Back to the Fundamentals
Multidimensional Joint Graphical Display of Symmetric Analysis: Back to the Fundamentals Shizuhiko Nishisato Abstract The basic premise of dual scaling/correspondence analysis lies in the simultaneous
More informationOn the uniform Poincaré inequality
On the uniform Poincaré inequality Abdesslam oulkhemair, Abdelkrim Chakib To cite this version: Abdesslam oulkhemair, Abdelkrim Chakib. On the uniform Poincaré inequality. Communications in Partial Differential
More informationPLS discriminant analysis for functional data
PLS discriminant analysis for functional data 1 Dept. de Statistique CERIM - Faculté de Médecine Université de Lille 2, 5945 Lille Cedex, France (e-mail: cpreda@univ-lille2.fr) 2 Chaire de Statistique
More informationDoubled patterns are 3-avoidable
Doubled patterns are 3-avoidable arxiv:1510.01753v1 [cs.dm] 6 Oct 2015 Pascal Ochem LIRMM, Université de Montpellier, CNRS Montpellier, France ochem@lirmm.fr August 31, 2018 Abstract In combinatorics on
More informationA Descent Algorithm for Solving Variational Inequality Problems
Applied Mathematical Sciences, Vol. 2, 2008, no. 42, 2095-2103 A Descent Algorithm for Solving Variational Inequality Problems Zahira Kebaili and A. Krim Keraghel Department of Mathematics, Sétif University
More informationVisualizing Dessins D Enfants
Willamette Valley Consortium for Mathematics Research Occidental College and Ave Maria University MAA MathFest August 7, 2014 Introduction Belyi Maps Dessins Passports Shabat Polynomials Question Example
More informationStructural Plots of Multivariate Binary Data
Structural Plots of Multivariate Binary Data Ove Frank Department of Statistics Stockholm University Ove.Frank@stat.su.se ABSTRACT: Data structures comprising many binary variables can be represented graphically
More informationA set of formulas for primes
A set of formulas for primes by Simon Plouffe December 31, 2018 Abstract In 1947, W. H. Mills published a paper describing a formula that gives primes : if A 1.3063778838630806904686144926 then A is always
More informationA Goodness-of-Fit Measure for the Mokken Double Monotonicity Model that Takes into Account the Size of Deviations
Methods of Psychological Research Online 2003, Vol.8, No.1, pp. 81-101 Department of Psychology Internet: http://www.mpr-online.de 2003 University of Koblenz-Landau A Goodness-of-Fit Measure for the Mokken
More informationInnovation and Regional Growth in the European Union
Riccardo Crescenzi Andres Rodriguez-Pose Innovation and Regional Growth in the European Union Springer Contents 1 Introduction 1 2 Theoretical Framework: A Spatial Perspective On Innovation and the Genesis
More informationHigh Dimensional Discriminant Analysis
High Dimensional Discriminant Analysis Charles Bouveyron 1,2, Stéphane Girard 1, and Cordelia Schmid 2 1 LMC IMAG, BP 53, Université Grenoble 1, 38041 Grenoble cedex 9 France (e-mail: charles.bouveyron@imag.fr,
More informationAvailable online at International Journal of Current Research Vol. 7, Issue, 11, pp , November, 2015
z Available online at http://www.journalcra.com International Journal of Current Research Vol. 7, Issue, 11, pp.23151-23157, November, 2015 INTERNATIONAL JOURNAL OF CURRENT RESEARCH ISSN: 0975-833X RESEARCH
More informationA set of formulas for primes
A set of formulas for primes by Simon Plouffe December 31, 2018 Abstract In 1947, W. H. Mills published a paper describing a formula that gives primes : if A 1.3063778838630806904686144926 then A is always
More informationRINGS IN POST ALGEBRAS. 1. Introduction
Acta Math. Univ. Comenianae Vol. LXXVI, 2(2007), pp. 263 272 263 RINGS IN POST ALGEBRAS S. RUDEANU Abstract. Serfati [7] defined a ring structure on every Post algebra. In this paper we determine all the
More informationInvestigation of damage mechanisms of polymer concrete: Multivariable analysis based on temporal features extracted from acoustic emission signals
Investigation of damage mechanisms of polymer concrete: Multivariable analysis based on temporal features extracted from acoustic emission signals Anne MAREC, Rachid BERBAOUI, Jean-Hugh TOMAS, Abderrahim
More informationHow smooth is the smoothest function in a given refinable space? Albert Cohen, Ingrid Daubechies, Amos Ron
How smooth is the smoothest function in a given refinable space? Albert Cohen, Ingrid Daubechies, Amos Ron A closed subspace V of L 2 := L 2 (IR d ) is called PSI (principal shift-invariant) if it is the
More informationProximity Measures for Data Described By Histograms Misure di prossimità per dati descritti da istogrammi Antonio Irpino 1, Yves Lechevallier 2
Proximity Measures for Data Described By Histograms Misure di prossimità per dati descritti da istogrammi Antonio Irpino 1, Yves Lechevallier 2 1 Dipartimento di Studi Europei e Mediterranei, Seconda Università
More informationEstimation Lasso et interactions poissoniennes sur le cercle
Estimation Lasso et interactions poissoniennes sur le cercle Laure SANSONNET Université catholique de Louvain, Belgique J-3 : AgroParisTech en collaboration avec : Patricia Reynaud-Bouret (Université Nice
More informationDESIGN AND ANALYSIS OF CAM MECHANISM WITH TRANSLATING FOLLOWER HAVING DOUBLE ROLLERS. Jung-Fa Hsieh
DESIGN AND ANALYSIS OF CAM MECHANISM WITH TRANSLATING FOLLOWER HAVING DOUBLE ROLLERS Jung-Fa Hsieh Far East University Department of Mechanical Engineering Tainan, 74448, Taiwan E-mail: seznof@cc.feu.edu.tw
More informationNONLINEAR PRINCIPAL COMPONENT ANALYSIS
NONLINEAR PRINCIPAL COMPONENT ANALYSIS JAN DE LEEUW ABSTRACT. Two quite different forms of nonlinear principal component analysis have been proposed in the literature. The first one is associated with
More informationAccurate critical exponents from the ϵ-expansion
Accurate critical exponents from the ϵ-expansion J.C. Le Guillou, J. Zinn-Justin To cite this version: J.C. Le Guillou, J. Zinn-Justin. Accurate critical exponents from the ϵ-expansion. Journal de Physique
More informationLinear Regression Model with Histogram-Valued Variables
Linear Regression Model with Histogram-Valued Variables Sónia Dias 1 and Paula Brito 1 INESC TEC - INESC Technology and Science and ESTG/IPVC - School of Technology and Management, Polytechnic Institute
More informationPackage RSDA. August 29, 2013
Package RSDA August 29, 2013 Type Package Title RSDA - R to Symbolic Data Analysis Version 1.1 Date 2013-06-29 Author Oldemar Rodriguez R. with contributions from Johnny Villalobos M. Maintainer Oldemar
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationEstimation of the extreme value index and high quantiles under random censoring
Estimation of the extreme value index and high quantiles under random censoring Jan Beirlant () & Emmanuel Delafosse (2) & Armelle Guillou (2) () Katholiee Universiteit Leuven, Department of Mathematics,
More informationCALCULATING THE VOLTAGE ACROSS A TURNED OFF SEMI- CONDUCTOR MODELLED BY PERFECT SWITCHES
CALCULATING THE VOLTAGE ACROSS A TURNED OFF SEMI- CONDUCTOR MODELLED BY PERFECT SWITCHES Faouzi BOULOS Department of Physics University Lebanese of Science II Fanar BP 90656, Jedeidet-El-Maten, Lebanon
More informationA multiple regression model for predicting rattle noise subjective rating from in-car microphones measurements
A multiple regression model for predicting rattle noise subjective rating from in-car microphones measurements B. Gauduin a, C. Noel a, J.-L. Meillier b and P. Boussard a a Genesis S.A., Bâtiment Gérard
More informationA DUALITY ALGORITHM FOR THE OBSTACLE PROBLEM
Ann. Acad. Rom. Sci. Ser. Math. Appl. ISSN 2066-6594 Vol. 5, No. -2 / 203 A DUALITY ALGORITHM FOR THE OBSTACLE PROBLEM Diana Merluşcă Abstract We consider the obstacle problem in Sobolev spaces, of order
More informationSELF-ADJOINTNESS OF DIRAC OPERATORS VIA HARDY-DIRAC INEQUALITIES
SELF-ADJOINTNESS OF DIRAC OPERATORS VIA HARDY-DIRAC INEQUALITIES MARIA J. ESTEBAN 1 AND MICHAEL LOSS Abstract. Distinguished selfadjoint extension of Dirac operators are constructed for a class of potentials
More informationPROPOSITIONAL MIXED LOGIC: ITS SYNTAX AND SEMANTICS
PROPOSITIONAL MIXED LOGIC: ITS SYNTAX AND SEMANTICS Karim NOUR 1 and Abir NOUR 2 Abstract In this paper, we present a propositional logic (called mixed logic) containing disjoint copies of minimal, intuitionistic
More informationLa question posée (en français, avec des mots justes ; pour un calcul, l'objectif doit être clairement écrit formellement)
Exercise : You have to make one ton of mayonnaise sauce using 95 % oil, 2.5 % egg yolk, 2.5 % vinegar. What is the minimum energy that you have to spend? Calculation for mayonnaise Hervé 4th October 2013
More informationProbability and Statistics. Volume II
Probability and Statistics Volume II Didier Dacunha-Castelle Marie Duflo Probability and Statistics Volume II Translated by David McHale Springer-Verlag New York Berlin Heidelberg Tokyo Didier Dacunha-Castelle
More informationConcepts of a Discrete Random Variable
Concepts of a Discrete Random Variable Richard Emilion Laboratoire MAPMO, Université d Orléans, B.P. 6759 45067 Orléans Cedex 2, France, richard.emilion@univ-orleans.fr Abstract. A formal concept is defined
More information1 Introduction CONVEXIFYING THE SET OF MATRICES OF BOUNDED RANK. APPLICATIONS TO THE QUASICONVEXIFICATION AND CONVEXIFICATION OF THE RANK FUNCTION
CONVEXIFYING THE SET OF MATRICES OF BOUNDED RANK. APPLICATIONS TO THE QUASICONVEXIFICATION AND CONVEXIFICATION OF THE RANK FUNCTION Jean-Baptiste Hiriart-Urruty and Hai Yen Le Institut de mathématiques
More informationInverse function theorems
Inverse function theorems Ivar Ekeland CEREMADE, Université Paris-Dauphine Ravello, May 2011 Ivar Ekeland (CEREMADE, Université Paris-Dauphine) Inverse function theorems Ravello, May 2011 1 / 18 Ekeland
More informationGraduate Texts in Mathematics 42. Editorial Board. F. W. Gehring P. R. Halmos Managing Editor. c. C. Moore
Graduate Texts in Mathematics 42 Editorial Board F. W. Gehring P. R. Halmos Managing Editor c. C. Moore Jean-Pierre Serre Linear Representations of Finite Groups Translated from the French by Leonard L.
More informationThe Story of a Research About the Nets of Platonic Solids with Cabri 3D: Conjectures Related to a Special Net Factor A Window for New Researches
The Story of a Research About the Nets of Platonic Solids with Cabri 3D: Conjectures Related to a Special Net Factor A Window for New Researches Jean-Jacques Dahan jjdahan@wanadoo.fr IREM of Toulouse Paul
More informationA Short Course in Basic Statistics
A Short Course in Basic Statistics Ian Schindler November 5, 2017 Creative commons license share and share alike BY: C 1 Descriptive Statistics 1.1 Presenting statistical data Definition 1 A statistical
More informationDIETRICH BURDE. do not hold for all x 30, contrary to some references. These estimates on π(x) came up recently in papers on algebraic number theory.
A REMARK ON AN INEQUALITY FOR THE PRIME COUNTING FUNCTION DIETRICH BURDE log() Abstract. We note that the inequalities 0.9 < π() < 1.11 log() do not hold for all 30, contrary to some references. These
More informationORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT
ORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT Pursuant to Article 1 of the Convention signed in Paris on 14th December 1960, and which came into force on 30th September 1961, the Organisation
More informationA note on top-k lists: average distance between two top-k lists
A note on top-k lists: average distance between two top-k lists Antoine Rolland Univ Lyon, laboratoire ERIC, université Lyon, 69676 BRON, antoine.rolland@univ-lyon.fr Résumé. Ce papier présente un complément
More informationMallows L 2 Distance in Some Multivariate Methods and its Application to Histogram-Type Data
Metodološki zvezki, Vol. 9, No. 2, 212, 17-118 Mallows L 2 Distance in Some Multivariate Methods and its Application to Histogram-Type Data Katarina Košmelj 1 and Lynne Billard 2 Abstract Mallows L 2 distance
More informationapproximation results for the Traveling Salesman and related Problems
approximation results for the Traveling Salesman and related Problems Jérôme Monnot To cite this version: Jérôme Monnot. approximation results for the Traveling Salesman and related Problems. Information
More information50 Years of Data Analysis
50 Years of Data Analysis from EDA to predictive modelling and machine learning Gilbert Saporta CEDRIC- CNAM, 292 rue Saint Martin, F-75003 Paris gilbert.saporta@cnam.fr http://cedric.cnam.fr/~saporta
More information1. What subatomic particles do these symbols represent in Model 1?
Coulombic attraction is the attraction between oppositely charged particles. For example, the protons in the nucleus of an atom have attraction for the electrons surrounding the nucleus. This is because
More information