Scalable Algorithms for Complex Networks

Similar documents
arxiv: v1 [cs.sy] 3 Mar 2011

Complex Mission Planning in Uncertain Adversarial Environment

STA141C: Big Data & High Performance Statistical Computing

Hyperbolic Polynomial Chaos Expansion (HPCE) and its Application to Statistical Analysis of Nonlinear Circuits

Lecture: Local Spectral Methods (1 of 4)

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering

A Non-Intrusive Polynomial Chaos Method For Uncertainty Propagation in CFD Simulations

Graphs in Machine Learning

Spectral Clustering. Guokun Lai 2016/10

Algorithms for Uncertainty Quantification

Preface. 2 Linear Equations and Eigenvalue Problem 22

Numerical Mathematics

Spectral Graph Theory

Estimating functional uncertainty using polynomial chaos and adjoint equations

Spectra of Adjacency and Laplacian Matrices

A Polynomial Chaos Approach to Robust Multiobjective Optimization

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Locally-biased analytics

Index. C 2 ( ), 447 C k [a,b], 37 C0 ( ), 618 ( ), 447 CD 2 CN 2

Applications of Randomized Methods for Decomposing and Simulating from Large Covariance Matrices

Final Report: DE-FG02-95ER25239 Spectral Representations of Uncertainty: Algorithms and Applications

Stochastic Collocation Methods for Polynomial Chaos: Analysis and Applications

Uncertainty Quantification and hypocoercivity based sensitivity analysis for multiscale kinetic equations with random inputs.

Hamiltonian System Approach to Distributed Spectral Decomposition in Networks

Non-Convex Optimization in Machine Learning. Jan Mrkos AIC

Uncertainty Quantification in MEMS

Uncertainty Quantification for multiscale kinetic equations with random inputs. Shi Jin. University of Wisconsin-Madison, USA

Networks and Their Spectra

Inverse Power Method for Non-linear Eigenproblems

Gradient Sampling for Improved Action Selection and Path Synthesis

Stochastic Spectral Approaches to Bayesian Inference

CPSC 540: Machine Learning

Markov Chains and Spectral Clustering

VCMC: Variational Consensus Monte Carlo

Semiclassical computational methods for quantum dynamics with bandcrossings. Shi Jin University of Wisconsin-Madison

An Efficient Graph Sparsification Approach to Scalable Harmonic Balance (HB) Analysis of Strongly Nonlinear RF Circuits

Complex Social System, Elections. Introduction to Network Analysis 1

1 Matrix notation and preliminaries from spectral graph theory

Graph Partitioning Using Random Walks

Power Grid Partitioning: Static and Dynamic Approaches

Data Analysis and Manifold Learning Lecture 7: Spectral Clustering

Multi-Robotic Systems

Safety Envelope for Load Tolerance and Its Application to Fatigue Reliability Design

Combining multiple surrogate models to accelerate failure probability estimation with expensive high-fidelity models

Adiabatic quantum computation a tutorial for computer scientists

Machine Learning for Data Science (CS4786) Lecture 11

Nonlinear seismic imaging via reduced order model backprojection

ECS231: Spectral Partitioning. Based on Berkeley s CS267 lecture on graph partition

Laplacian Agent Learning: Representation Policy Iteration

Review for Exam 2 Ben Wang and Mark Styczynski

1 Matrix notation and preliminaries from spectral graph theory

Stochastic structural dynamic analysis with random damping parameters

FORMATIONS OF FORMATIONS: HIERARCHY AND STABILITY

Uncertainty Quantification in Performance Evaluation of Manufacturing Processes

Stable Adaptive Co-simulation: A Switched Systems Approach. Cláudio Gomes, Benoît Legat, Raphaël M. Jungers, Hans Vangheluwe

Index. higher order methods, 52 nonlinear, 36 with variable coefficients, 34 Burgers equation, 234 BVP, see boundary value problems

Analysis of Spectral Kernel Design based Semi-supervised Learning

Uncertainty Quantification for multiscale kinetic equations with high dimensional random inputs with sparse grids

Spectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods

Balancing and Control of a Freely-Swinging Pendulum Using a Model-Free Reinforcement Learning Algorithm

Uncertainty Quantification in Computational Science

Spectral Clustering on Handwritten Digits Database

Random Vibrations & Failure Analysis Sayan Gupta Indian Institute of Technology Madras

Numerical Schemes from the Perspective of Consensus

Faster Kinetics: Accelerate Your Finite-Rate Combustion Simulation with GPUs

Numerical Methods for Engineers and Scientists

Introduction to Uncertainty Quantification in Computational Science Handout #3

March 13, Paper: R.R. Coifman, S. Lafon, Diffusion maps ([Coifman06]) Seminar: Learning with Graphs, Prof. Hein, Saarland University

4 Film Extension of the Dynamics: Slowness as Stability

3.320 Lecture 23 (5/3/05)

Statistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara

EFFICIENT SHAPE OPTIMIZATION USING POLYNOMIAL CHAOS EXPANSION AND LOCAL SENSITIVITIES

Spectral Techniques for Clustering

SOLVING POWER AND OPTIMAL POWER FLOW PROBLEMS IN THE PRESENCE OF UNCERTAINTY BY AFFINE ARITHMETIC

Applied Numerical Analysis

Diffusion and random walks on graphs

Dinesh Kumar, Mehrdad Raisee and Chris Lacor

Discretization of SDEs: Euler Methods and Beyond

Numerical Methods with MATLAB

Distributed Adaptive Consensus Protocol with Decaying Gains on Directed Graphs

Chapter 1, Section 1.2, Example 9 (page 13) and Exercise 29 (page 15). Use the Uniqueness Tool. Select the option ẋ = x

Preliminary Examination, Numerical Analysis, August 2016

Suggestions for Making Useful the Uncertainty Quantification Results from CFD Applications

The RAMSES code and related techniques 4. Source terms

NUMERICAL METHODS FOR ENGINEERING APPLICATION

MBS/FEM Co-Simulation Approach for Analyzing Fluid/Structure- Interaction Phenomena in Turbine Systems

Bayesian Methods for Machine Learning

Chaospy: A modular implementation of Polynomial Chaos expansions and Monte Carlo methods

Enabling Advanced Automation Tools to manage Trajectory Prediction Uncertainty

Decentralized Estimation of Laplacian Eigenvalues in Multi-Agent Systems

Numerical Analysis for Statisticians

Convex Optimization of Graph Laplacian Eigenvalues

Introduction to Computational Stochastic Differential Equations

Lecture V: The game-engine loop & Time Integration

Communities, Spectral Clustering, and Random Walks

A Distributed Newton Method for Network Utility Maximization, II: Convergence

Analysis and synthesis: a complexity perspective

Approximate Inference

Global Analysis of Piecewise Linear Systems Using Impact Maps and Surface Lyapunov Functions

Distributed Distance-Bounded Network Design Through Distributed Convex Programming

Transcription:

Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut

Outline / Acknowledgements Decentralized Clustering Alberto Speranzon (UTRC) Andrzej Banaszuk (UTRC) Distributed Computation Stefan Klus (U. Paderborn/UTRC) Michael Dellnitz (U. Paderborn) Uncertainty Quantification José Miguel Pasini (UTRC) Amit Surana (UTRC) Andrzej Banaszuk (UTRC) Vladimir Fonoberov (Aimdyn Inc.) Sophie Loire (UC Santa Barbara) 2

Complex Networks: Building Systems 40% of produced energy in the US is consumed by buildings Thermal models T amb T osur R win T isur T zone Energy Efficient Integrated Buildings Large Sensor Networks CFD, Reduced Order Models Q surfo 1/h o A C Analysis & Control R C 1/h i A Q structure Q surfi Large ODE Models Distributed Computation Uncertainty Quantification Monitoring, Security, Occupancy Estimation etc. Distributed Estimation Distributed Optimization Need Scalable & Decentralized Solutions 3

Complex Networks: Smart Grids & Aircraft Systems Renewable energy sources: DOE s 2030 goal 20% of all energy to have renewable sources Aircraft Electrical Systems No Central Control Decentralized Control Heterogeneous Switching systems Stability Analysis FAA Certification Uncertainty Quantification 4

Complex Networks: UAV Swarms/Sensor Nets Networks of mobile sensors needed for various tasks Path planning for search & rescue [Mathew and Mezic, 09] Centralized: Changing Environment? UAV Swarm Sample Prior Distribution Sensor Coverage [Ghrist and Jadbabaie] Slow: Random Walk/ Consensus Novel algorithms needed for deployment in applications Mobile sensors need to self-organize using local information Role of uncertainty quantification for random parameters 5

Complex Networks: Social Graphs Analysis of large social networks: time-varying and multi- attributed interactions GUARD-DOG Program Social Network Parallel/Cloud Computation Community detection Games on social networks Scalable algorithms Other important complex networks: MEMS oscillator networks, biological systems etc. 6

Scalable Solutions for Complex Networks Divide and Conquer - help build scalable solutions for various problems Partition Graph/Data Implement distributed algorithm Desired: Decentralized, scalable and fast solutions for clustering, computation, estimation, uncertainty quantification etc. Clustering enables scalability 7

Shape from Harmonic Analysis Can one hear the shape of a drum? an article by Mark Kac: do frequencies of vibrations determine the shape uniquely? Sparked decades of mathematical activity! Answer: Yes (if convexity assumed) Similarly we ask: Can one hear the properties of a graph (such as cluster locations)? Answer: Yes (if no symmetries) Non-convex drums with same frequencies M. Kac, Can One Hear the Shape of a Drum, American Mathematical Monthly, 1966 8

Graph Analysis Matrix approach for representing graphs: Laplacian has nice properties Eigenvalues: Eigenvectors: Eigenvalues: Connected components, diameter etc Spectral Gap: Number of clusters Eigenvectors: Clustering, Localization, Sensor Coverage, PageRank etc F. Chung, Spectral Graph Theory, 1997 9

Spectral Clustering (Centralized) Clusters: set of nodes strongly connected to each other Partition graphs based on: Strength of connection NP-Complete where, Size of clusters Relax problem Rayleigh-Ritz U. v. Luxburg, A Tutorial on Spectral Clustering, Statistics and Computing, 17(4), 2007 10

Spectral Clustering (Centralized) Eigenvector of Laplacian holds the clustering information.. + - + - + + It is extended to more than 2 clusters by using higher eigenvectors (combinations of signs) Very popular approach Drawbacks: Solution is expensive for large graphs Central computer needed M. Fiedler, Czechoslovak Mathematical Journal, 1975. 11

Distributed Spectral Clustering Orthogonal Iterations on a graph Random Walk on the graph Random walks can be slow equivalent to evolving the heat equation on the graph. Convergence: : Measure of time for random walk to get to steady state D. Kempe and F. McSherry, A Decentralized Algorithm for Spectral Analysis, 2008. 12

Insight Traditional distributed clustering uses random walks Heat Equation (random walks): On the graph: Wave equation: On the graph: The solution of the heat equation dies out, but the wave equation does not M. Hein, J.-Y. Audibert and U. V. Luxburg, From Graphs to Manifolds - Weak and Strong Pointwise Consistency of Graph Laplacians, 2005. T. Sahai, A. Speranzon and A. Banaszuk, Hearing the Clusters in a Graph: A Distributed Algorithm, Automatica, 2012. 13

Example of the Algorithm + + - - + - + - + + - - + + C1 C2 C3 C4 C1 C3 C2 Local FFT 14

Main Result Evolving the wave equation combined with local frequency estimation is equivalent to computing the eigenvectors of the graph Laplacian 15

Proof Outline Need to start with a random initial condition 16

Performance vs. Random Walk It can be shown that: Time needed to resolve the lowest frequency in the FFT Compare to [Kempe & McSherry, 2008] Orders of magnitude faster than random walks is a function of 17

Comments about the Wave Equation Computes all eigenvectors and eigenvalues of the Laplacian Useful for distributed graph analysis and self-organizing networks Algorithm is decentralized (no central node required) Fast Convergence (faster than random-walk based methods) Only scalar quantities exchanged (low communication cost) 18

Results: Line Graph 19

Applications & Results Social Network & Data Analysis Community detection in MapReduce (DARPA) Algorithm to detect disconnected components and isolated nodes Scalable analysis of large quantities of sensor data Fortunato Benchmark Example Distributed Estimation Contaminant Flow in Buildings Decentralized Path Planning Sensors self-organize into groups Accelerates distributed estimation by orders of magnitude Extends work by Mathew and Mezic 2009 Mobile agents self-organize into groups Decentralized computation of trajectories T. Sahai, A. Speranzon and A. Banaszuk, Hearing the Clusters in a Graph: A Distributed Algorithm 20

Distributed Computation Waveform Relaxation: Algorithm for distributed simulation of differential equations on parallel computers Split Variables in CPU #1 Variables in CPU #2 Goal: Simulate for the interval Compute Iterate J. K. White and A. Sangiovanni-Vincentelli, Relaxation Techniques for the Simulation of VLSI Circuits, 1986. 21

Distributed Computation Waveform Relaxation always converges as long as the original function is Lipschitz continuous S. Klus, T. Sahai, C. Liu and M. Dellnitz (2011), An Efficient Algorithm for the Parallel Solution of High-Dimensional Differential Equations, Journal of Computational and Applied Mathematics.. 22

Adaptive Waveform Relaxation Split into parts Reduce by reducing when estimate for is large Take large steps when function changes slowly and small steps when function changes fast Waveform relaxation can be represented as: Adaptive Waveform Relaxation Start with an initial time interval of say: Compute the solution using waveform relaxation Let be the number of desired iterations and the desired tolerance Estimate the next time interval: Where is estimated using a standard extrapolation formula 23

Optimal Splitting for AWR Optimal splitting changes with the numerical scheme and step size: NP complete Spectral Clustering is a good heuristic for splitting (symmetrized Jacobian) Optimal for Implicit Euler: Implicit Euler: Optimal for Implicit Euler: Trapezoidal Rule: 24

Thermal Model of a Building Energy consumption in Buildings Equations for Rooms & Walls Newton s Law of Cooling Conduction Solar HVAC Appliances/People 25

Thermal Model of a Building Energy consumption in Buildings Models T amb T osur R win T isur T zone 1/h o A R 1/h i A Q structure Q surfo C C Q surfi 1.4 ODEs 1.2 1 23 Clusters 0.8 Clustering 0.6 0.4 0.2 0 1 floor gives 140 states -0.2 0 10 20 30 40 50 60 70 AWR 10 times faster than WR AWR & WR 26

Propagating Uncertainty through Complex Networks Given input (parametric) uncertainty, quantify the output uncertainty: Monte Carlo Quasi-Monte Carlo Response Surface Methods Polynomial Chaos/Probabilistic Collocation Methods Combination of the above T. Sahai, V. Fonoberov and S. Loire (2010), Uncertainty as a stabilizer of the head-tail ordered phase in carbon-monoxide monolayers on graphite, Physical Review B. 27

Polynomial Chaos Starting with the system: : distribution on Expand the state variables (or outputs) as: Orthogonal Polynomials Performing a Galerkin Projection gives: Determining determines Exponential convergence rate for processes with finite variance. Curse of dimensionality: X. Wan and G. E. Karniadakis (2008), Recent Advances in Polynomial Chaos Methods and Extensions 28

UQ for Hybrid Systems H-PC + Wavelet expansions + Boundary Layers help deal with hybrid dynamical systems curse of dimensionality H-PC: Hybrid dynamics : Uncertain Wavelets Bouncing Ball Monte Carlo Boundary Layer T. Sahai and J. M. Pasini, Uncertainty Quantification in Hybrid Dynamical Systems, 2013 29

Scalable Uncertainty Quantification Use decentralized clustering to partition the dynamical system into weakly interacting components Weak interactions lower order expansions Random Parameters A. Surana, T. Sahai and A. Banaszuk (2012), Iterative Methods for Scalable Uncertainty Quantification in Complex Networks, International Journal of Uncertainty Quantification. 30

Results: Scalable UQ Consider a two room model with uncertain parameters R win T amb T osur T isur T zone 1/h o A R 1/h i A Q structure Q surfo C C Q surfi Random parameters Gaussian distributions 2 rooms give a 10 state model Clustering + AWR 60 state model 31

Conclusions/Future Work Motivated from continuous approaches one can construct efficient approaches for NP-hard problems such as graph partitioning Scalability is enabled by graph decomposition Simulating high-dimensional dynamical systems Polynomial chaos based uncertainty quantification in complex networks Future Directions include: Distributed Computation: Index 2 DAEs, Equation-Free Methods Clustering of time varying multi-attributed graphs for distributed computation, sensor coverage and localization Machine Learning for Big-Data problems 32

Thank You! 33