Clustering and Blockmodeling

Similar documents
Generalized Ward and Related Clustering Problems

Generalized Blockmodeling with Pajek

Centrality in Social Networks

Mallows L 2 Distance in Some Multivariate Methods and its Application to Histogram-Type Data

NP-hardness results for linear algebraic problems with interval data

UNIVERZA V LJUBLJANI FAKULTETA ZA DRUŽBENE VEDE ALEŠ ŽIBERNA

Peripheries of cohesive subsets

Generalized Blockmodeling of Valued Networks

Peripheries of Cohesive Subsets

1 Introduction and Results

Structural Plots of Multivariate Binary Data

arxiv: v1 [stat.ml] 27 Nov 2011

OPTIMIZATION TECHNIQUES FOR EDIT VALIDATION AND DATA IMPUTATION

Okunoye Babatunde O. Department of Pure and Applied Biology, Ladoke Akintola University of Technology, Ogbomoso, Nigeria.

Matrix period in max-drast fuzzy algebra

Positional Analysis in Fuzzy Social Networks

Universality class of triad dynamics on a triangular lattice

How to count trees? arxiv:cond-mat/ v3 [cond-mat.stat-mech] 7 Apr 2005

Fast Hierarchical Clustering from the Baire Distance

Fractal Dimension and Vector Quantization

THE IMPACT OF ICT GROWTH ON HOUSEHOLDS AND MUNICIPALITIES IN THE CZECH NUTS-3 REGIONS: THE APPLICATION OF CLUSTER ANALYSIS

Hadamard matrices of order 32

Visual Network Analysis

Discrete Applied Mathematics

Minimum Cost Homomorphisms to Semicomplete Multipartite Digraphs

University of Twente. Faculty of Mathematical Sciences. Scheduling split-jobs on parallel machines. University for Technical and Social Sciences

ON THE METRIC DIMENSION OF CIRCULANT GRAPHS WITH 4 GENERATORS

Maximizing the Cohesion is NP-hard

Gearing optimization

The spectra of super line multigraphs

Recall Versus Recognition : Comparison of the Two Alternative Procedures for Collecting Social Network Data

MAT 224: Foundations of Higher Mathematics

INVERTING SIGNED GRAPHS*

Lattices and dimensional representations: matrix decompositions and ordering structures

On the consensus of closure systems

GYULA FARKAS WOULD ALSO FEEL PROUD

The Multiple Traveling Salesman Problem with Time Windows: Bounds for the Minimum Number of Vehicles

Finite Presentations of Pregroups and the Identity Problem


Discrete Applied Mathematics

Volume 35, Issue 4. Note on goodness-of-fit measures for the revealed preference test: The computational complexity of the minimum cost index

A Design and a Code Invariant

A unique representation of polyhedral types. Centering via Möbius transformations

NOTES ON THE INDEPENDENCE NUMBER IN THE CARTESIAN PRODUCT OF GRAPHS

Network Data: an overview

Cluster Graph Modification Problems

M.Sc Geo-informatics Semester II Paper-VI : Advanced Course on Human Geography

Polya theory (With a radar application motivation)

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Stochastic Stability - H.J. Kushner

Hamilton Cycles in Digraphs of Unitary Matrices

Reconstructing subgraph-counting graph polynomials of increasing families of graphs

Histogram data analysis based on Wasserstein distance

Single Machine Scheduling with a Non-renewable Financial Resource

A new linear regression model for histogram-valued variables

A f = A f (x)dx, 55 M F ds = M F,T ds, 204 M F N dv n 1, 199 !, 197. M M F,N ds = M F ds, 199 (Δ,')! = '(Δ)!, 187

SYSTEM EVALUATION AND DESCRIPTION USING ABSTRACT RELATION TYPES (ART)

On minors of the compound matrix of a Laplacian

The Local Spectra of Regular Line Graphs

SCIENCE CITATION INDEX - MATHEMATICS, APPLIED - JOURNAL LIST Total journals: 79

Histogram data analysis based on Wasserstein distance

THERE IS NO FINITELY ISOMETRIC KRIVINE S THEOREM JAMES KILBANE AND MIKHAIL I. OSTROVSKII

Distribution of Global Defensive k-alliances over some Graph Products

Resolving infeasibility in extremal algebras 1

Enumerating the edge-colourings and total colourings of a regular graph

Understanding (dis)similarity measures

Decision trees on interval valued variables

Abbott, A Sequence analysis: new methods for old ideas. Annual Review of Sociology 21:

UNIVERSITÀ DEGLI STUDI DI ROMA LA SAPIENZA

Two Further Gradient BYY Learning Rules for Gaussian Mixture with Automated Model Selection

Czechoslovak Mathematical Journal

Construction of `Wachspress type' rational basis functions over rectangles

Distance in Graphs - Taking the Long View

Groups of vertices and Core-periphery structure. By: Ralucca Gera, Applied math department, Naval Postgraduate School Monterey, CA, USA

ADVANCES IN NETWORK CLUSTERING AND BLOCKMODELING

Small-Area Population Forecasting Using a Spatial Regression Approach

Memoirs on Differential Equations and Mathematical Physics

QR FACTORIZATIONS USING A RESTRICTED SET OF ROTATIONS

Spectral results on regular graphs with (k, τ)-regular sets

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

Hybrid Transition Modes in (Tissue) P Systems

Neighborly families of boxes and bipartite coverings

Mathematica Slovaca. Dragan Marušič; Tomaž Pisanski The remarkable generalized Petersen graph G(8, 3) Terms of use:

Computing the (k-)monopoly number of direct product of graphs

On Hamiltonian cycles and Hamiltonian paths

Computing distance moments on graphs with transitive Djoković-Winkler s relation

PREFERENCE MATRICES IN TROPICAL ALGEBRA

Combinatorial Algorithms for Minimizing the Weighted Sum of Completion Times on a Single Machine

The Solvability Conditions for the Inverse Eigenvalue Problem of Hermitian and Generalized Skew-Hamiltonian Matrices and Its Approximation

Deciding Emptiness of the Gomory-Chvátal Closure is NP-Complete, Even for a Rational Polyhedron Containing No Integer Point

Fly Cheaply: On the Minimum Fuel Consumption Problem

A Picture for Complex Stochastic Boolean Systems: The Intrinsic Order Graph

Title fibring over the circle within a co. Citation Osaka Journal of Mathematics. 42(1)

On the eigenvalues of Euclidean distance matrices

ON THE CONTINUITY OF GLOBAL ATTRACTORS

Node similarity and classification

Longitudinal Data Analysis. Michael L. Berbaum Institute for Health Research and Policy University of Illinois at Chicago

(Kac Moody) Chevalley groups and Lie algebras with built in structure constants Lecture 1. Lisa Carbone Rutgers University

Markovian Combination of Decomposable Model Structures: MCMoSt

Constrained controllability of semilinear systems with delayed controls

Transcription:

Clustering and Blockmodeling Vladimir Batagelj University of Ljubljana, Slovenia June 28, 2002 References [1] Aarts, E. & Lenstra, J.K. (Eds.)(1997). Local Search in Combinatorial Optimization. Wiley, Chichester. [2] Anderberg, M.R. (1973), Cluster analysis for applications. Probability and mathematical statistics, vol.19. Academic Press, New York. [3] Batagelj, V. (1981), Note on ultrametric hierarchical clustering algorithms. Psychometrika, 46(3), 351-352. [4] Batagelj, V. (1984), Notes on the dynamic clusters method, in Proceedings of the 4th Conference on Applied Mathematics 1984, Split, University of Split, 1985, 139-146. [5] Batagelj, V. (1985), Algorithmic aspects of clustering problem, Proceedings of 7th International Symposium Computer at the University, Cavtat 1985, SRCE, Zagreb, p. 502 1-502 15. [6] Batagelj, V. (1986), On adding clustering algorithms. COMPSTAT 86, short communications, Rome. [7] Batagelj, V. (1988), Generalized Ward and related clustering problems. In: Bock, H.H. (Ed.): Classification and related methods of data analysis, North- Holland, Amsterdam, p. 67 74. [8] Batagelj, V. (1989), Similarity measures between structured objects, in A. Graovac (Ed.), Proceedings MATH/CHEM/COMP 1988, Dubrovnik, Yugoslavia 20-25 June 1988, Studies in Physical and Theoretical Chemistry. Vol 63, pp. 25-40, Amsterdam: Elsevier. 1

[9] Batagelj, V., Pisanski, T., and Simões-Pereira, J.M.S. (1990), An algorithm for tree-realizability of distance matrices. International Journal of Computer Mathematics, 34, pp. 171-176. [10] Batagelj, V., Ferligoj, A., and Doreian, P. (1992), Direct and Indirect Methods for Structural Equivalence. Social Networks 14, 63-90. [11] Batagelj, V., Doreian, P., and Ferligoj, A. (1992), An Optimizational Approach to Regular Equivalence. Social Networks 14, 121-135. [12] Batagelj, V., Korenjak-Černe, S., and Klavžar, S. (1994) Dynamic Programming and Convex Clustering. Algorithmica 11, 93-103. [13] Batagelj, V. (1994), Semirings for Social Networks Analysis. Journal of Mathematical Socilogy, 191, 53-68. [14] Batagelj, V., and Bren, M. (1995), Comparing Resemblance Measures, Journal of Classification, 12(1), 73-90. [15] Batagelj, V. (1997), Notes on blockmodeling. Social Networks 19, 143-155. [16] Batagelj, V., Mrvar, A. and Zaveršnik, M. (1999), Partitioning approach to visualization of large graphs, In Kratochvíl, J. (ed), Lecture notes in computer science, 1731, Springer, Berlin, 90 97. [17] Batagelj, V., and Mrvar, A. (2001), A subquadratic triad census algorithm for large sparse networks with small maximum degree. Social networks, 23(3), 237-243. [18] Batagelj, V. and Zaveršnik, M. (2002) Generalized cores, submitted. [19] Batagelj, V. and Zaveršnik, M. (2002) Triangular connectivity and its generalizations, in preparation. [20] Benzécri, J.P. (1976), L analyse des donnees, Tome 1., 2., Paris: Dunod. [21] Bock, H.H. and Diday, E., (Eds.) (2000), Analysis of Symbolic Data. Exploratory methods for extracting statistical information from complex data. Springer, Heidelberg. [22] Borgatti, S.P., and Everett, M.G. (1989), The Class of all Regular Equivalences: Algebraic Structure and Computation, Social Networks, 11, 65-88. [23] Borgatti, S.P., and Everett, M.G. (1992). Notions of positions in social network analysis. In P. V. Marsden (Ed.), Sociological Methodology (pp. 1 35). San Francisco: Jossey-bass. 2

[24] Boyd, J.P. (1991). Social Semigroups. A Unified Theory of Scaling and Blockmodelling as Applied to Social Networks. Fairfax: George Mason University Press. [25] Brigham, R.C., and Dutton, R.D. (1985). A compilation of relations between graph invariants. Networks, 15, 73 107. [26] Brucker, P. (1978), On the complexity of clustering problems, in Optimization and Operations Research, Proceedings, Bonn 1977, in Lecture Notes in Economics and Mathematical Systems, Vol. 157, eds. R.Henn, B. Korte and W. Oettli, Berlin: Springer-Verlag. [27] Bruynooghe, M. (1977), Méthodes nouvelles en classification automatique des données taxinomiques nombreuses. Statistique et Analyse des Données, 3, 24-42. [28] Carré, B. (1979). Graphs and Networks. Oxford: Clarendon. [29] E. Chávez, G. Navarro, R. Baeza-Yates, and J. Marroqu in (1999). Searching in Metric Spaces. Technical Report TR/DCC-99-3, Dept. of Computer Science, Univ. of Chile. http://citeseer.nj.nec.com/avez99searching.html [30] De Nooy, W., Mrvar A., Batagelj, V. (2002), Exploratory Social Network Analysis with Pajek. Manuscript of a textbook. [31] Degenne, A., and Forseé, M. (1999), Introducing social networks, London: Sage. [32] Dickerson M.T., and Eppstein D. (1996), Algorithms for proximity problems in higher dimensions. Comp. Geom. Theory & Applications 5, 277-291. http://www.ics.uci.edu/ eppstein/pubs/ a-dickerson.html [33] Diday, E. et al. (1979), Optimisation en classification automatique, Tome 1., 2., Rocquencourt: INRIA. [34] Dieudonne, J. (1960), Foundations of modern analysis. Academic Press, New York. [35] Doreian, P., Batagelj, V., Ferligoj, A. (2000), Symmetric-acyclic decompositions of networks. J. classif., 17(1), 3-28. 3

[36] Doreian, P., Batagelj, V., Ferligoj, A. (2001), Positional analyses of sociometric data. Chapter for the book: Models and methods in social network analysis. New York: Cambridge University Press (forthcoming). [37] Doreian, P., Batagelj, V., Ferligoj, A. (2002), Generalized blockmodeling. Manuscript of a book. [38] Doreian, P., Fujimoto K. (2001), Structures of Supreme Court Voting. University of Pittsburgh, manuscript, version November 3, 2001. [39] Doreian, P., and Mrvar, A. (1996). A partitioning approach to structural balance. Social Networks, 18, 149 168. [40] Esfahanian A-H., On the Evolution of Graph Connectivity Algorithms. http://www.cse.msu.edu/ esfahani/ book chapter/graph connectivity chapter.pdf [41] Everett, M.G. (1983), EBLOC: A graph theoretic blocking algorithm for social networks. Social Networks 5, 323-346. [42] Everett, M.E and Borgatti, S.P. (1996), Exact Colorations of Graphs and Digraphs. Social Networks, 18, 319-331. [43] Everitt, B. (1974), Cluster Analysis. Heinemann Educational Books LTD., London. [44] Fuast, K. (1988), Comparison of Methods for Positional Analysis: Structural and General Equivalences, Social Networks, 10, 313-341. [45] Ferligoj A., Batagelj V. (1982), Clustering with relational constraint. Psychometrika, 47(4), 413-426. [46] Ferligoj A., Batagelj V. (1983), Some types of clustering with relational constraints. Psychometrika, 48(4), 541-552. [47] Ferligoj, A., and Batagelj, V. (1992), Direct Multicriteria Clustering Algorithms. Journal of Classification, 9(1), 43-61. [48] Fisher, W.D. (1958), On grouping for maximum homogeneity. Journal of the American Statistical Association, 53, 789-798. [49] Fukunaga, K., Narendra, P.M. (1975), A branch and bound algorithm for computing k-nearest neighbors. IEEE Transactions on Computers, C-24, 750-753. 4

[50] Ganti V. etal. (1998), Clustering large datasets in arbitrary metric spaces. http://www.cs.wisc.edu/ vganti/pubs.html [51] Garey, M.R. and Johnson, D.S. (1979), Computer and intractability, San Francisco: Freeman. [52] Godehardt, E. (1988), Graphs as Structural Models, Brounschweig: Vieweg. [53] Gordon, A.D.(1981), Classification. Monographs on Applied Probability and Statistics. Chapman and Hall, London. [54] Gordon, A.D. (1996). A survey of constrained classification. Computational Statistics & Data Analysis, 21, 17 29. [55] Gower, J.C.(1971), A general coefficient of similarity and some of its properties. Biometrics 27, 857-871. [56] Harary, F, Norman, R.Z., and Cartwright, D. (1965), Structural Models: An Introduction to the Theory of Directed Graphs, New York: Wiley. [57] Harel, D., and Koren, J. (2001), On Clustering Using Random Walks, LNCS 2245, Berlin: Springer-Verlag, 18-41. [58] Hartigan, J.A. (1975), Clustering Algorithms, New York: John-Wiley. [59] Hubálek, Z. (1982), Coefficients of association and similarity, based on binary (presence-absence) data: an evaluation, Biological Review 57, 669-689. [60] Hummon, N.P. & Doreian, P. (1989). Connectivity in a citation network: The development of dna theory. Social Networks, 11, 39 63. [61] Jambu M., Lebeaux M-O. (1983), Cluster analysis and data analysis, North- Holland, Amsterdam. [62] Janowitz, M.F. (1978). An order theoretic model for cluster analysis. SIAM J. Appl. Math., 34, 55 72. [63] Jardin N., Sibson R.(1971), Mathematical taxonomy. John Wiley, London. [64] Jensen R.E. (1969), A dynamic programming algorihm for cluster analysis. Operational research, 7, 1034-1057. 5

[65] Kamvar, S.D., and Klein, D., and Manning, C.D. (2002), Interpreting and Extending Classical Agglomerative Clustering Algorithms using a Model- Based Approach. ICML 2002. http://nlp.stanford.edu/ danklein/ project-clustering.shtml [66] Kashyap R.L., Oommen B.J. (1983), A common basis for similarity measures involving two strings. International Journal of Computer Mathematics 13, 17-40. [67] Korenjak-Černe S. and Batagelj V. (1998), Clustering large datasets of mixed units, in Rizzi, A., Vichi, M., Bock, H.-H. (Eds.): Advances in Data Science and Classification. Springer, Berlin, p. 43 48. [68] Levenshtein V.I. (1966), Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics - Doklady, 10, 707-710. [69] Lorrain, F. and H.C. White (1971): Structural equivalence of individuals in social networks. Journal of Mathematical Sociology, 1:49 80. [70] Lorrain, F. (1975): Réseaux sociaux et classifications sociales. Paris: Hermann. [71] Marriott, F.H.C. (1982), Optimization methods of cluster analysis, Biometrika, 69, 417-421. [72] Matula D.W.(1977) Graph theoretic techniques for cluster analysis algorithms. Van Ryzin J. (ed.): Classification and clustering. Academic Press, New York, 95-127. [73] Mirkin, B.G. (1980), Analiz kačestvennyh priznakov i struktur, Moskva: Statistika. [74] Mirkin, B.G., and Rodin C.N. (1977), Grafy i geny, Moskva: Nauka. [75] Moody, J. (2001), Peer influence groups: identifying dense clusters in large networks. Social Networks 23, 261-283. [76] Moreno, J.L. (1953), Who shall survive?, Beacon N.Y.: Beacon House. [77] Murtagh, F. (1985), Multidimensional Clustering Algorithms, Compstat lectures, 4, Vienna: Physica-Verlag. 6

[78] Murtagh, F. (1999), Clustering in Massive Data Sets, Proceedings of the Beilstein Workshop 2000: Chemical Data Analysis in the Large, May 22nd - 26th 2000, Bozen, Italy, p. 28-51 http://www.beilstein-institut.de/bozen2000/ proceedings/contents.html [79] Olson, C.F. (1995), Parallel Algorithm for Hierarchical Clustering, Parallel computing, 21, 1313-1325. http://faculty.washington.edu/cfolson/papers.html [80] Pattison, P.E. (1988), Network Models; Some Comments on Papers in this Special Issue, Social Networks, 10, 383 411. [81] Pattison, P.E. (1993). Algebraic Models for Social Networks. Cambridge, England: Cambridge University Press. [82] Roberts, F.S. (1976), Discrete mathematical models with applications to social, biological, and environmental problems. Englewood Cliffs, N.J.: Prentice Hall. [83] Schreider, Ju.A. (1975), Equality, resemblance, and order, Moskva: Mir. [84] Scott, J. (2000), Social Network Analysis: A Handbook, 2nd edition. London: Sage Publications. [85] Seidman, S.B., (1983), Network Structure And Minimum Degree. Social Networks, 5, 269-287. [86] Shamos, M.I. (1976), Geometry and statistics: Problems at the interface, in J. Traub (Ed.): Algorithms and Complexity (New directions and recent results). New York: Academic Press, 251-288. [87] Sneath P.H.A., Sokal R.R. (1973), Numerical taxonomy. W.H. Freeman and Company, San Francisco. [88] Snyder, D., Kick, E. (1979), Structural position in the world system and economic growth 1955-70: A multiple network analysis of transnational interactions. American Journal of Sociology 84, 1096-1126. [89] Späth H. (1977), Cluster Analyse Algorithmen zur Objekt-Klassifizierung und Datenreduction. R. Oldenbourg, München. [90] Trinajstić, N. (1983), Chemical graph theory, Vol. 2. CRC Press, Boca Raton. 7

[91] Van Cutsem, B., ed. (1994), Classification and Dissimilarity Analysis, Lecture Notes in Statistics, New York: Springer Verlag. [92] Ward, J.H. (1963), Hierarchical grouping to optimize an objective function, American Statistical Association Journal, 58, 236-244. [93] Wasserman, S. & Faust, K. (1994). Social Network Analysis: Methods and Applications. Cambridge: Cambridge University Press. [94] White, D.R. and K.P. Reitz (1983), Graph and semigroup homomorphisms on networks of relations. Social Networks, 5:193 234. [95] Index of clustering papers on the WWW: http://www.ece.nwu.edu/ harsha/clustering/clus.html 8