n-level Graph Partitioning

Size: px
Start display at page:

Download "n-level Graph Partitioning"

Transcription

1 Vitaly Osipov, Peter Sanders - Algorithmik II 1 Vitaly Osipov: KIT Universität des Landes Baden-Württemberg und nationales Grossforschungszentrum in der Helmholtz-Gemeinschaft Institut für Theoretische Informatik

2 Overview Introduction ngp KasPar Contraction Local Search Experimental Evaluation Future Work 2 Vitaly Osipov:

3 Introduction Graph Partitioning G = (V, E, c, ω) ω : E R >0 c : V R 0, n = V, m = E. V 1 V k = V, s.t. V i V j = for i = j c(v i ) L max = (1 + ɛ)c(v )/k + max v V c(v) minimize cut i<j w(e ij ) E ij = {{u, v} E u V i, v V j } 3 Vitaly Osipov:

4 Introduction Multilevel Approaches Edge Contraction {u, v} x ω({x, z}) = ω({u, z}) + ω({v, z}). c(x) = c(u) + c(v) u v z z x contraction phase match contract input graph local improvement initial partitioning uncontract output partition refinement phase 4 Vitaly Osipov:

5 ngp n-gp(g,k,ɛ) begin if G is small then return initialpartition(g,k,ɛ) end pick the edge e = {u, v} with the highest rating contract(e) P :=n-gp(g,k,ɛ) activate(u) activate(v) localsearch() return P end 5 Vitaly Osipov:

6 Karlsrue Sequential Partitioner Contraction contract a single edge between two levels possibly n levels finegrained contraction consequtive levels are very similar no matching algorithm required use of different edge ratings uniform distribution of node weights priority queue defines the order of edges to be contracted Local Search efficent stopping creteria avoid quadratic runtime General Trial Trees improve quality by independent trials 6 Vitaly Osipov:

7 Contraction edge rating - expansion ({u, v}) = ω({u,v}) c(u)c(v) addressable priority queue based on pairing heaps dynamic graph data structure memory overhead Vitaly Osipov: m

8 Local Search Nodes unmarked, active, marked active nodes compute gains of moving from one block to another choose the maximum gain when moved active marked can t become active anymore unmarked neighbours of marked active gains in each step identically distributed, independent random variables expectation µ variance σ 2 compute m and σ 2 from previous steps stop after p steps if pµ 2 > ασ 2 + β 8 Vitaly Osipov: u v v u

9 Local Search Does it help? local search steps / n k=64 k=32 k=16 k=8 k=4 k= n without stoping criteria less than 1% improvement orders of magnitute slower 9 Vitaly Osipov:

10 Experimental evaluation Platform and Settings System one core of Intel Xeon Quad-core Processor featuring 2x4 MB of L2 cache and clocked at GHz of a 2 processor Intel Xeon X5355 node 16 GB of RAM gcc % imbalance 10 repetitions for the small networks 5 repetitions for the other averaging over multiple instances geometric mean 10 Vitaly Osipov:

11 Experimental evaluation two sets - small and large real world graphs Table: Geometric means (times, cut values) over all instances. code small graphs large graphs social networks best avg. t[s] best avg. t[s] best avg. t[s] KaSPar strong KaSPar fast KaSPar fast, α = KaPPa strong KaPPa fast kmetis Scotch On average 5.9% vs KaPPa strong (small instances) 8.1% vs KaPPa strong (large instances) 32% vs Metis (large instances) Repeating scotch as long as KasPar strong run and choosing the best result 12.1% larger cuts 11 Vitaly Osipov:

12 Experimental evaluation Walshaw s Benchmark 34 graphs for k {2, 4, 8, 16, 32, 64} and balance parameter ɛ {0, 0.01, 0.03, 0.05} 816 entries KasPar in one hour per entrie improved 155 values 42 for 1% 49 for 3% 64 for 5% if counting only large graphs (>44K nodes) and ɛ > 0 63% of the entries reproduces equaly good cuts in 83 other entries 12 Vitaly Osipov:

13 Future work better initial partitioner for large k exploit shared memory parallelism 13 Vitaly Osipov:

14 Thank you! 14 Vitaly Osipov:

Nikolaj Leischner, Vitaly Osipov, Peter Sanders

Nikolaj Leischner, Vitaly Osipov, Peter Sanders Nikolaj Leichner, Vitaly Oipov, Peter Sander Intitut für Theoretiche Informatik - Algorithmik II Oipov: 1 KITVitaly Univerität de Lande Baden-Württemerg und nationale Groforchungzentrum in der Helmholtz-Gemeinchaft

More information

Monte Carlo neutron transport and thermal-hydraulic simulations using Serpent 2/SUBCHANFLOW

Monte Carlo neutron transport and thermal-hydraulic simulations using Serpent 2/SUBCHANFLOW Monte Carlo neutron transport and thermal-hydraulic simulations using Serpent 2/SUBCHANFLOW M. Knebel (Presented by V. Valtavirta) Institute for Neutron Physics and Reactor Technology (INR) Reactor Physics

More information

Dynamical Low-Rank Approximation to the Solution of Wave Equations

Dynamical Low-Rank Approximation to the Solution of Wave Equations Dynamical Low-Rank Approximation to the Solution of Wave Equations Julia Schweitzer joint work with Marlis Hochbruck INSTITUT FÜR ANGEWANDTE UND NUMERISCHE MATHEMATIK 1 KIT Universität des Landes Baden-Württemberg

More information

Demonstration of Full PWR Core Coupled Monte Carlo Neutron Transport and Thermal-Hydraulic Simulations Using Serpent 2/ SUBCHANFLOW

Demonstration of Full PWR Core Coupled Monte Carlo Neutron Transport and Thermal-Hydraulic Simulations Using Serpent 2/ SUBCHANFLOW Demonstration of Full PWR Core Coupled Monte Carlo Neutron Transport and Thermal-Hydraulic Simulations Using Serpent 2/ SUBCHANFLOW M. Daeubler Institute for Neutron Physics and Reactor Technology (INR)

More information

Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors

Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1 1 Deparment of Computer

More information

Messung des differentiellen Wirkungsquerschnitts der Z-Boson Produktion im Elektron-Zerfallskanal mit dem CMS-Detektor bei s=8 TeV.

Messung des differentiellen Wirkungsquerschnitts der Z-Boson Produktion im Elektron-Zerfallskanal mit dem CMS-Detektor bei s=8 TeV. Messung des differentiellen Wirkungsquerschnitts der Z-Boson Produktion im Elektron-Zerfallskanal mit dem CMS-Detektor bei s=8 TeV. DPG Frühjahrstagung 2015 Dominik Haitz, Klaus Rabbertz, Günter Quast

More information

J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009

J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009 Parallel Preconditioning of Linear Systems based on ILUPACK for Multithreaded Architectures J.I. Aliaga M. Bollhöfer 2 A.F. Martín E.S. Quintana-Ortí Deparment of Computer Science and Engineering, Univ.

More information

Incomplete Cholesky preconditioners that exploit the low-rank property

Incomplete Cholesky preconditioners that exploit the low-rank property anapov@ulb.ac.be ; http://homepages.ulb.ac.be/ anapov/ 1 / 35 Incomplete Cholesky preconditioners that exploit the low-rank property (theory and practice) Artem Napov Service de Métrologie Nucléaire, Université

More information

2016 SM H ττ Analysis 26th September 2016

2016 SM H ττ Analysis 26th September 2016 2016 SM H ττ Analysis 26th September 2016 INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK (EKP) FAKULTÄT FÜR PHYSIK KIT Universität des Landes Baden Württemberg und nationales Forschungszentrum in der Helmholtz

More information

Minimizing Clock Latency Range in Robust Clock Tree Synthesis

Minimizing Clock Latency Range in Robust Clock Tree Synthesis Minimizing Clock Latency Range in Robust Clock Tree Synthesis Wen-Hao Liu Yih-Lang Li Hui-Chi Chen You have to enlarge your font. Many pages are hard to view. I think the position of Page topic is too

More information

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS

More information

Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota

Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM CSE Boston - March 1, 2013 First: Joint work with Ruipeng Li Work

More information

StreamSVM Linear SVMs and Logistic Regression When Data Does Not Fit In Memory

StreamSVM Linear SVMs and Logistic Regression When Data Does Not Fit In Memory StreamSVM Linear SVMs and Logistic Regression When Data Does Not Fit In Memory S.V. N. (vishy) Vishwanathan Purdue University and Microsoft vishy@purdue.edu October 9, 2012 S.V. N. Vishwanathan (Purdue,

More information

Higgs Boson Physics. Analysis Techniques. Günter Quast, Roger Wolf, Andrew Gilbert Master-Kurs SS

Higgs Boson Physics. Analysis Techniques. Günter Quast, Roger Wolf, Andrew Gilbert Master-Kurs SS Higgs Boson Physics Analysis Techniques Günter Quast, Roger Wolf, Andrew Gilbert Master-Kurs SS 2015 Institut für Experimentelle Kernphysik KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum

More information

Introduction. Marcel Radermacher Algorithmen zur Visualisierung von Graphen

Introduction. Marcel Radermacher Algorithmen zur Visualisierung von Graphen Algorithms for Graph Visualization INSTITUT FÜR THEORETISCHE INFORMATIK FAKULTÄT FÜR INFORMATIK Marcel Radermacher 30.11.2017 1 Introduction Before: always based on some properties: tree, series-parallel

More information

From CityGML to Energy ADE - Workflow

From CityGML to Energy ADE - Workflow From CityGML to Energy ADE - Workflow KIT Universität von Baden-Württemberg und nationales Forschungszentrum der Helmholtz Gemeinschaft www.kit.edu Workflow overview CityGML import Geometric / semantic

More information

Analytical Modeling of Parallel Programs (Chapter 5) Alexandre David

Analytical Modeling of Parallel Programs (Chapter 5) Alexandre David Analytical Modeling of Parallel Programs (Chapter 5) Alexandre David 1.2.05 1 Topic Overview Sources of overhead in parallel programs. Performance metrics for parallel systems. Effect of granularity on

More information

Mixed Integer Programming:

Mixed Integer Programming: Mixed Integer Programming: Analyzing 12 Years of Progress Roland Wunderling CPLEX Optimizer Architect Background 2001: Manfred Padberg s60 th birthday Bixby et al., Mixed-Integer Programming: A Progress

More information

Highly-scalable branch and bound for maximum monomial agreement

Highly-scalable branch and bound for maximum monomial agreement Highly-scalable branch and bound for maximum monomial agreement Jonathan Eckstein (Rutgers) William Hart Cynthia A. Phillips Sandia National Laboratories Sandia National Laboratories is a multi-program

More information

CSCI Final Project Report A Parallel Implementation of Viterbi s Decoding Algorithm

CSCI Final Project Report A Parallel Implementation of Viterbi s Decoding Algorithm CSCI 1760 - Final Project Report A Parallel Implementation of Viterbi s Decoding Algorithm Shay Mozes Brown University shay@cs.brown.edu Abstract. This report describes parallel Java implementations of

More information

Algorithms for Graph Visualization Force-Directed Algorithms

Algorithms for Graph Visualization Force-Directed Algorithms Algorithms for Graph Visualization INSTITUT FÜR THEORETISCHE INFORMATIK FAKULTÄT FÜR INFORMATIK Tamara Mchedlidze 21.12.2016 1 Introduction Before: always based on some properties: tree, series-parallel

More information

Oeding (Auburn) tensors of rank 5 December 15, / 24

Oeding (Auburn) tensors of rank 5 December 15, / 24 Oeding (Auburn) 2 2 2 2 2 tensors of rank 5 December 15, 2015 1 / 24 Recall Peter Burgisser s overview lecture (Jan Draisma s SIAM News article). Big Goal: Bound the computational complexity of det n,

More information

Massively parallel semi-lagrangian solution of the 6d Vlasov-Poisson problem

Massively parallel semi-lagrangian solution of the 6d Vlasov-Poisson problem Massively parallel semi-lagrangian solution of the 6d Vlasov-Poisson problem Katharina Kormann 1 Klaus Reuter 2 Markus Rampp 2 Eric Sonnendrücker 1 1 Max Planck Institut für Plasmaphysik 2 Max Planck Computing

More information

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!

More information

Composition-Climate Interactions: The recent past

Composition-Climate Interactions: The recent past Composition-Climate Interactions: The recent past Peter Braesicke I Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag) KIT Universität des Landes Baden-Württemberg und

More information

Technische Universität Dresden Institute of Numerical Mathematics

Technische Universität Dresden Institute of Numerical Mathematics Technische Universität Dresden Institute of Numerical Mathematics An Improved Flow-based Formulation and Reduction Principles for the Minimum Connectivity Inference Problem Muhammad Abid Dar Andreas Fischer

More information

UTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement

UTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement UTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement Wuxi Li, Meng Li, Jiajun Wang, and David Z. Pan University of Texas at Austin wuxili@utexas.edu November 14, 2017 UT DA Wuxi Li

More information

Analytical Modeling of Parallel Systems

Analytical Modeling of Parallel Systems Analytical Modeling of Parallel Systems Chieh-Sen (Jason) Huang Department of Applied Mathematics National Sun Yat-sen University Thank Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar for providing

More information

UV-LED Module Design with Maximum Power Density

UV-LED Module Design with Maximum Power Density UV-LED Module Design with Maximum Power Density Manfred Scholdt 1, Christian Herbold 1, Marc Schneider 2, Cornelius Neumann 1 1 e 2 Institute for Data Processing and Electronics e (LTI) KIT Universität

More information

Models: Amdahl s Law, PRAM, α-β Tal Ben-Nun

Models: Amdahl s Law, PRAM, α-β Tal Ben-Nun spcl.inf.ethz.ch @spcl_eth Models: Amdahl s Law, PRAM, α-β Tal Ben-Nun Design of Parallel and High-Performance Computing Fall 2017 DPHPC Overview cache coherency memory models 2 Speedup An application

More information

Population Estimation: Using High-Performance Computing in Statistical Research. Craig Finch Zia Rehman

Population Estimation: Using High-Performance Computing in Statistical Research. Craig Finch Zia Rehman Population Estimation: Using High-Performance Computing in Statistical Research Craig Finch Zia Rehman Statistical estimation Estimated Value Confidence Interval Actual Value Estimator: a rule for finding

More information

CSE 4502/5717 Big Data Analytics Spring 2018; Homework 1 Solutions

CSE 4502/5717 Big Data Analytics Spring 2018; Homework 1 Solutions CSE 502/5717 Big Data Analytics Spring 2018; Homework 1 Solutions 1. Consider the following algorithm: for i := 1 to α n log e n do Pick a random j [1, n]; If a[j] = a[j + 1] or a[j] = a[j 1] then output:

More information

Scalable String and Suffix Sorting: Algorithms, Techniques, and Tools

Scalable String and Suffix Sorting: Algorithms, Techniques, and Tools Scalable String and Suffix Sorting: Algorithms, Techniques, and Tools Timo Bingmann Dissertation Defense July 3rd, 28 INSTITUTE OF THEORETICAL INFORMATICS ALGORITHMICS KIT The Research University in the

More information

Parallel PIPS-SBB Multi-level parallelism for 2-stage SMIPS. Lluís-Miquel Munguia, Geoffrey M. Oxberry, Deepak Rajan, Yuji Shinano

Parallel PIPS-SBB Multi-level parallelism for 2-stage SMIPS. Lluís-Miquel Munguia, Geoffrey M. Oxberry, Deepak Rajan, Yuji Shinano Parallel PIPS-SBB Multi-level parallelism for 2-stage SMIPS Lluís-Miquel Munguia, Geoffrey M. Oxberry, Deepak Rajan, Yuji Shinano ... Our contribution PIPS-PSBB*: Multi-level parallelism for Stochastic

More information

Characterizing Quantum Supremacy in Near-Term Devices

Characterizing Quantum Supremacy in Near-Term Devices Characterizing Quantum Supremacy in Near-Term Devices S. Boixo S. Isakov, V. Smelyanskiy, R. Babbush, M. Smelyanskiy, N. Ding, Z. Jiang, M. J. Bremner, J. Martinis, H. Neven Google January 19th Beyond-classical

More information

A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations

A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations Peter Benner Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory

More information

Sparse solver 64 bit and out-of-core addition

Sparse solver 64 bit and out-of-core addition Sparse solver 64 bit and out-of-core addition Prepared By: Richard Link Brian Yuen Martec Limited 1888 Brunswick Street, Suite 400 Halifax, Nova Scotia B3J 3J8 PWGSC Contract Number: W7707-145679 Contract

More information

Environment (Parallelizing Query Optimization)

Environment (Parallelizing Query Optimization) Advanced d Query Optimization i i Techniques in a Parallel Computing Environment (Parallelizing Query Optimization) Wook-Shin Han*, Wooseong Kwak, Jinsoo Lee Guy M. Lohman, Volker Markl Kyungpook National

More information

PetaBricks: Variable Accuracy and Online Learning

PetaBricks: Variable Accuracy and Online Learning PetaBricks: Variable Accuracy and Online Learning Jason Ansel MIT - CSAIL May 4, 2011 Jason Ansel (MIT) PetaBricks May 4, 2011 1 / 40 Outline 1 Motivating Example 2 PetaBricks Language Overview 3 Variable

More information

QR Decomposition in a Multicore Environment

QR Decomposition in a Multicore Environment QR Decomposition in a Multicore Environment Omar Ahsan University of Maryland-College Park Advised by Professor Howard Elman College Park, MD oha@cs.umd.edu ABSTRACT In this study we examine performance

More information

Machine learning in Higgs analyses

Machine learning in Higgs analyses Machine learning in Higgs analyses Higgs Couplings 2017 Marco A. Harrendorf on behalf of the CMS collaboration October 9th, 2017 INSTITUT FÜR EXPERIMENTELLE TEILCHENPHYSIK (ETP) KIT Universität des Landes

More information

Where is SUSY? Institut für Experimentelle Kernphysik

Where is SUSY?   Institut für Experimentelle Kernphysik Where is SUSY? Institut ür Experimentelle Kernphysik KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschat www.kit.edu I supersymmetric particles exist,

More information

On the design of parallel linear solvers for large scale problems

On the design of parallel linear solvers for large scale problems On the design of parallel linear solvers for large scale problems ICIAM - August 2015 - Mini-Symposium on Recent advances in matrix computations for extreme-scale computers M. Faverge, X. Lacoste, G. Pichon,

More information

iretilp : An efficient incremental algorithm for min-period retiming under general delay model

iretilp : An efficient incremental algorithm for min-period retiming under general delay model iretilp : An efficient incremental algorithm for min-period retiming under general delay model Debasish Das, Jia Wang and Hai Zhou EECS, Northwestern University, Evanston, IL 60201 Place and Route Group,

More information

Preparatory Material for the European Intensive Program in Bydgoszcz 2011 Analytical and computer assisted methods in mathematical models

Preparatory Material for the European Intensive Program in Bydgoszcz 2011 Analytical and computer assisted methods in mathematical models Preparatory Material for the European Intensive Program in Bydgoszcz 2011 Analytical and computer assisted methods in mathematical models September 4{18 Basics on the Lebesgue integral and the divergence

More information

Multi-Approximate-Keyword Routing Query

Multi-Approximate-Keyword Routing Query Bin Yao 1, Mingwang Tang 2, Feifei Li 2 1 Department of Computer Science and Engineering Shanghai Jiao Tong University, P. R. China 2 School of Computing University of Utah, USA Outline 1 Introduction

More information

A Tight Lower Bound for Dynamic Membership in the External Memory Model

A Tight Lower Bound for Dynamic Membership in the External Memory Model A Tight Lower Bound for Dynamic Membership in the External Memory Model Elad Verbin ITCS, Tsinghua University Qin Zhang Hong Kong University of Science & Technology April 2010 1-1 The computational model

More information

Cache-Oblivious Computations: Algorithms and Experimental Evaluation

Cache-Oblivious Computations: Algorithms and Experimental Evaluation Cache-Oblivious Computations: Algorithms and Experimental Evaluation Vijaya Ramachandran Department of Computer Sciences University of Texas at Austin Dissertation work of former PhD student Dr. Rezaul

More information

Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners

Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners José I. Aliaga Leveraging task-parallelism in energy-efficient ILU preconditioners Universidad Jaime I (Castellón, Spain) José I. Aliaga

More information

Low-ranksemidefiniteprogrammingforthe. MAX2SAT problem. Bosch Center for Artificial Intelligence

Low-ranksemidefiniteprogrammingforthe. MAX2SAT problem. Bosch Center for Artificial Intelligence Low-ranksemidefiniteprogrammingforthe MAX2SAT problem Po-Wei Wang J. Zico Kolter Machine Learning Department Carnegie Mellon University School of Computer Science, Carnegie Mellon University, and Bosch

More information

Algorithm Theory - Exercise Class

Algorithm Theory - Exercise Class Algorithm Theory - Exercise Class Exercise Lesson 3 Albert-Ludwigs-Universität Freiburg Philipp Schneider Algorithms and Complexity - Professor Dr. Fabian Kuhn Organizational Matters English Tutorial in

More information

The Finite Cell Method: High order simulation of complex structures without meshing

The Finite Cell Method: High order simulation of complex structures without meshing The Finite Cell Method: High order simulation of complex structures without meshing E. Rank, A. Düster, D. Schillinger, Z. Yang Fakultät für Bauingenieur und Vermessungswesen Technische Universität München,

More information

Drawing Large Graphs by Multilevel Maxent-Stress Optimization

Drawing Large Graphs by Multilevel Maxent-Stress Optimization Drawing Large Graphs by Multilevel Maxent-Stress Optimization Henning Meyerhenke, Martin Nöllenburg, Christian Schulz Annweiler, TOR 2016 (ursprünglich präsentiert auf der GD 2015), KIT 1 H. Meyerhenke

More information

arxiv: v1 [cs.dm] 19 Jan 2019

arxiv: v1 [cs.dm] 19 Jan 2019 Computing Optimal Tangles Faster arxiv:1901.06548v1 [cs.dm] 19 Jan 2019 Oksana Firman Institut für Informatik, Universität Würzburg, Germany firstname.lastname@uni-wuerzburg.de Philipp Kindermann Institut

More information

Prioritized Garbage Collection Using the Garbage Collector to Support Caching

Prioritized Garbage Collection Using the Garbage Collector to Support Caching Prioritized Garbage Collection Using the Garbage Collector to Support Caching Diogenes Nunez, Samuel Z. Guyer, Emery D. Berger Tufts University, University of Massachusetts Amherst November 2, 2016 D.

More information

Discrete Wiskunde II. Lecture 5: Shortest Paths & Spanning Trees

Discrete Wiskunde II. Lecture 5: Shortest Paths & Spanning Trees , 2009 Lecture 5: Shortest Paths & Spanning Trees University of Twente m.uetz@utwente.nl wwwhome.math.utwente.nl/~uetzm/dw/ Shortest Path Problem "#$%&'%()*%"()$#+,&- Given directed "#$%&'()*+,%+('-*.#/'01234564'.*,'7+"-%/8',&'5"4'84%#3

More information

Embedded Systems - FS 2018

Embedded Systems - FS 2018 Institut für Technische Informatik und Kommunikationsnetze Prof. L. Thiele Embedded Systems - FS 2018 Sample solution to Exercise 3 Discussion Date: 11.4.2018 Aperiodic Scheduling Task 1: Earliest Deadline

More information

Parallel Transposition of Sparse Data Structures

Parallel Transposition of Sparse Data Structures Parallel Transposition of Sparse Data Structures Hao Wang, Weifeng Liu, Kaixi Hou, Wu-chun Feng Department of Computer Science, Virginia Tech Niels Bohr Institute, University of Copenhagen Scientific Computing

More information

A Practical Parallel Algorithm for Diameter Approximation of Massive Weighted Graphs

A Practical Parallel Algorithm for Diameter Approximation of Massive Weighted Graphs A Practical Parallel Algorithm for Diameter Approximation of Massive Weighted Graphs Matteo Ceccarello Joint work with Andrea Pietracaprina, Geppino Pucci, and Eli Upfal Università di Padova Brown University

More information

DES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models

DES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models 4. Petri Nets Introduction Different Classes of Petri Net Petri net properties Analysis of Petri net models 1 Petri Nets C.A Petri, TU Darmstadt, 1962 A mathematical and graphical modeling method. Describe

More information

On the design of parallel linear solvers for large scale problems

On the design of parallel linear solvers for large scale problems On the design of parallel linear solvers for large scale problems Journée problème de Poisson, IHP, Paris M. Faverge, P. Ramet M. Faverge Assistant Professor Bordeaux INP LaBRI Inria Bordeaux - Sud-Ouest

More information

Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain

Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain March 4-5, 2015 Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain M. Bonnasse-Gahot 1,2, H. Calandra 3, J. Diaz 1 and S. Lanteri 2

More information

Models, Data, Learning Problems

Models, Data, Learning Problems Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Models, Data, Learning Problems Tobias Scheffer Overview Types of learning problems: Supervised Learning (Classification, Regression,

More information

Topic 1: Atmosphere and Climate

Topic 1: Atmosphere and Climate Topic 1: Atmosphere and Climate Peter Braesicke Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag) KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum

More information

Computing least squares condition numbers on hybrid multicore/gpu systems

Computing least squares condition numbers on hybrid multicore/gpu systems Computing least squares condition numbers on hybrid multicore/gpu systems M. Baboulin and J. Dongarra and R. Lacroix Abstract This paper presents an efficient computation for least squares conditioning

More information

Parallelism in Structured Newton Computations

Parallelism in Structured Newton Computations Parallelism in Structured Newton Computations Thomas F Coleman and Wei u Department of Combinatorics and Optimization University of Waterloo Waterloo, Ontario, Canada N2L 3G1 E-mail: tfcoleman@uwaterlooca

More information

On the Computational Complexity of the Discrete Pascal Transform

On the Computational Complexity of the Discrete Pascal Transform 6 th International Conference Logic and Applications LAP 207, September 8-22, 207, Dubrovnik, Croatia On the Computational Complexity of the Discrete Pascal Transform Dušan B. Gajić, Radomir S. Stanković

More information

Performance Metrics for Computer Systems. CASS 2018 Lavanya Ramapantulu

Performance Metrics for Computer Systems. CASS 2018 Lavanya Ramapantulu Performance Metrics for Computer Systems CASS 2018 Lavanya Ramapantulu Eight Great Ideas in Computer Architecture Design for Moore s Law Use abstraction to simplify design Make the common case fast Performance

More information

Patrick Kirchgaeßer 07. Januar 2016

Patrick Kirchgaeßer 07. Januar 2016 Patrick Kirchgaeßer 07. Januar 2016 INSTITUTE OF EXPERIMENTAL PARTICLE PHYSICS (IEKP) PHYSICS FACULTY KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft

More information

ICS 252 Introduction to Computer Design

ICS 252 Introduction to Computer Design ICS 252 fall 2006 Eli Bozorgzadeh Computer Science Department-UCI References and Copyright Textbooks referred [Mic94] G. De Micheli Synthesis and Optimization of Digital Circuits McGraw-Hill, 1994. [CLR90]

More information

T U M. Performing Permuting on a Multiprocessor Architecture Using Packet Communication I N S T I T U T F Ü R I N F O R M A T I K

T U M. Performing Permuting on a Multiprocessor Architecture Using Packet Communication I N S T I T U T F Ü R I N F O R M A T I K T U M I S T I T U T F Ü R I F O R M A T I K Performing Permuting on a Multiprocessor Architecture Using Packet Communication Riko Jacob Michael Schnupp ABCDE FGHIJ KLMO TUM-I0817 Juni 08 T E C H I S C

More information

Module 5: CPU Scheduling

Module 5: CPU Scheduling Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation 5.1 Basic Concepts Maximum CPU utilization obtained

More information

Analysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College

Analysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College Analysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College Why analysis? We want to predict how the algorithm will behave (e.g. running time) on arbitrary inputs, and how it will

More information

An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets

An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets IEEE Big Data 2015 Big Data in Geosciences Workshop An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets Fatih Akdag and Christoph F. Eick Department of Computer

More information

Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems

Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Jos M. Badía 1, Peter Benner 2, Rafael Mayo 1, Enrique S. Quintana-Ortí 1, Gregorio Quintana-Ortí 1, A. Remón 1 1 Depto.

More information

Chapter 6: CPU Scheduling

Chapter 6: CPU Scheduling Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation 6.1 Basic Concepts Maximum CPU utilization obtained

More information

Multitasking Polynomial Homotopy Continuation in PHCpack. Jan Verschelde

Multitasking Polynomial Homotopy Continuation in PHCpack. Jan Verschelde Multitasking Polynomial Homotopy Continuation in PHCpack Jan Verschelde University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science http://www.math.uic.edu/ jan jan@math.uic.edu

More information

Discrete Optimization 2010 Lecture 3 Maximum Flows

Discrete Optimization 2010 Lecture 3 Maximum Flows Remainder: Shortest Paths Maximum Flows Discrete Optimization 2010 Lecture 3 Maximum Flows Marc Uetz University of Twente m.uetz@utwente.nl Lecture 3: sheet 1 / 29 Marc Uetz Discrete Optimization Outline

More information

MPI at MPI. Jens Saak. Max Planck Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory

MPI at MPI. Jens Saak. Max Planck Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory MAX PLANCK INSTITUTE November 5, 2010 MPI at MPI Jens Saak Max Planck Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory FOR DYNAMICS OF COMPLEX TECHNICAL

More information

Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems

Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Ichitaro Yamazaki University of Tennessee, Knoxville Xiaoye Sherry Li Lawrence Berkeley National Laboratory MS49: Sparse

More information

Novel Heuristics for Coalition Structure Generation in Multi-agent Systems

Novel Heuristics for Coalition Structure Generation in Multi-agent Systems , June 3 - July 2, 21, London, U.K. Novel Heuristics for Coalition Structure Generation in Multi-agent Systems Mustansar Ali Ghazanfar and Adam Prugel-Bennett Abstract A coalition is a set of self-interested

More information

Jacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA

Jacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA Jacobi-Based Eigenvalue Solver on GPU Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline Symmetric eigenvalue solver Experiment Applications Conclusions Symmetric eigenvalue solver The standard form is

More information

Scalable Software for Multivariate Integration on Hybrid Platforms

Scalable Software for Multivariate Integration on Hybrid Platforms Journal of Physics: Conference Series PAPER OPEN ACCESS Scalable Software for Multivariate Integration on Hybrid Platforms To cite this article: E de Doncker et al 2015 J. Phys.: Conf. Ser. 640 012062

More information

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros

More information

Measurement of Jet Energy Scale and Resolution at ATLAS and CMS at s = 8 TeV

Measurement of Jet Energy Scale and Resolution at ATLAS and CMS at s = 8 TeV Measurement of Jet Energy Scale and Resolution at ATLAS and CMS at s = 8 TeV EDSBlois 2015 02.07.2015 Dominik Haitz on behalf of the ATLAS and CMS Collaborations INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK

More information

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros

More information

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago

More information

A Better Scoring Model for De Novo Peptide Sequencing: The Symmetric Difference between Explained and Measured Masses Supplementary Figures

A Better Scoring Model for De Novo Peptide Sequencing: The Symmetric Difference between Explained and Measured Masses Supplementary Figures A Better Scoring Model for De Novo Peptide Sequencing: The Symmetric Difference between Explained and Measured Masses Supplementary Figures Thomas Tschager *, Simon Rösch *, Ludovic Gillet 2 and Peter

More information

Atmospheric Water Vapor Effect on GNSS Signals and InSAR Data

Atmospheric Water Vapor Effect on GNSS Signals and InSAR Data Atmospheric Water Vapor Effect on GNSS Signals and InSAR Data Basic Concept and Preliminary Results F. Alshawaf 1, S. Hinz 1, A. Thiele 1 T. Fuhrmann 2, B. Heck 2, A. Knöpfler 2, X. Luo 2, M. Mayer 2,

More information

Advanced Data Structures

Advanced Data Structures Simon Gog gog@kit.edu - 0 Simon Gog: KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu Dynamic Perfect Hashing What we want: O(1) lookup

More information

A parallel exponential integrator for large-scale discretizations of advection-diffusion models

A parallel exponential integrator for large-scale discretizations of advection-diffusion models A parallel exponential integrator for large-scale discretizations of advection-diffusion models L. Bergamaschi 1, M. Caliari 2, A. Martínez 3, and M. Vianello 3 1 Department of Mathematical Methods and

More information

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Some Geometric and Algebraic Aspects of Domain Decomposition Methods Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition

More information

The Number of Spanning Trees in a Graph

The Number of Spanning Trees in a Graph Course Trees The Ubiquitous Structure in Computer Science and Mathematics, JASS 08 The Number of Spanning Trees in a Graph Konstantin Pieper Fakultät für Mathematik TU München April 28, 2008 Konstantin

More information

arxiv: v1 [math.co] 3 Feb 2014

arxiv: v1 [math.co] 3 Feb 2014 Enumeration of nonisomorphic Hamiltonian cycles on square grid graphs arxiv:1402.0545v1 [math.co] 3 Feb 2014 Abstract Ed Wynn 175 Edmund Road, Sheffield S2 4EG, U.K. The enumeration of Hamiltonian cycles

More information

Ozone_CCI Contribution

Ozone_CCI Contribution Ozone_CCI Contribution Peter Braesicke (CRG) on behalf of the Ozone_CCI Team Science Lead: Michel Van Roozendael, BIRA Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag)

More information

c 2015 Society for Industrial and Applied Mathematics

c 2015 Society for Industrial and Applied Mathematics SIAM J. SCI. COMPUT. Vol. 37, No. 2, pp. C169 C193 c 2015 Society for Industrial and Applied Mathematics FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper

More information

CSL model checking of biochemical networks with Interval Decision Diagrams

CSL model checking of biochemical networks with Interval Decision Diagrams CSL model checking of biochemical networks with Interval Decision Diagrams Brandenburg University of Technology Cottbus Computer Science Department http://www-dssz.informatik.tu-cottbus.de/software/mc.html

More information

Dense Arithmetic over Finite Fields with CUMODP

Dense Arithmetic over Finite Fields with CUMODP Dense Arithmetic over Finite Fields with CUMODP Sardar Anisul Haque 1 Xin Li 2 Farnam Mansouri 1 Marc Moreno Maza 1 Wei Pan 3 Ning Xie 1 1 University of Western Ontario, Canada 2 Universidad Carlos III,

More information

Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization

Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization Denis Ridzal Department of Computational and Applied Mathematics Rice University, Houston, Texas dridzal@caam.rice.edu

More information

On the Performance of SC-MMSE-FD Equalization for Fixed-Point Implementations

On the Performance of SC-MMSE-FD Equalization for Fixed-Point Implementations On the Performance of SC-MMSE-FD Equalization for Fixed-Point Implementations ISTC 2014, Bremen, Tamal Bose and Friedrich K. Jondral KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum

More information