n-level Graph Partitioning
|
|
- Ashlie Phoebe Bennett
- 5 years ago
- Views:
Transcription
1 Vitaly Osipov, Peter Sanders - Algorithmik II 1 Vitaly Osipov: KIT Universität des Landes Baden-Württemberg und nationales Grossforschungszentrum in der Helmholtz-Gemeinschaft Institut für Theoretische Informatik
2 Overview Introduction ngp KasPar Contraction Local Search Experimental Evaluation Future Work 2 Vitaly Osipov:
3 Introduction Graph Partitioning G = (V, E, c, ω) ω : E R >0 c : V R 0, n = V, m = E. V 1 V k = V, s.t. V i V j = for i = j c(v i ) L max = (1 + ɛ)c(v )/k + max v V c(v) minimize cut i<j w(e ij ) E ij = {{u, v} E u V i, v V j } 3 Vitaly Osipov:
4 Introduction Multilevel Approaches Edge Contraction {u, v} x ω({x, z}) = ω({u, z}) + ω({v, z}). c(x) = c(u) + c(v) u v z z x contraction phase match contract input graph local improvement initial partitioning uncontract output partition refinement phase 4 Vitaly Osipov:
5 ngp n-gp(g,k,ɛ) begin if G is small then return initialpartition(g,k,ɛ) end pick the edge e = {u, v} with the highest rating contract(e) P :=n-gp(g,k,ɛ) activate(u) activate(v) localsearch() return P end 5 Vitaly Osipov:
6 Karlsrue Sequential Partitioner Contraction contract a single edge between two levels possibly n levels finegrained contraction consequtive levels are very similar no matching algorithm required use of different edge ratings uniform distribution of node weights priority queue defines the order of edges to be contracted Local Search efficent stopping creteria avoid quadratic runtime General Trial Trees improve quality by independent trials 6 Vitaly Osipov:
7 Contraction edge rating - expansion ({u, v}) = ω({u,v}) c(u)c(v) addressable priority queue based on pairing heaps dynamic graph data structure memory overhead Vitaly Osipov: m
8 Local Search Nodes unmarked, active, marked active nodes compute gains of moving from one block to another choose the maximum gain when moved active marked can t become active anymore unmarked neighbours of marked active gains in each step identically distributed, independent random variables expectation µ variance σ 2 compute m and σ 2 from previous steps stop after p steps if pµ 2 > ασ 2 + β 8 Vitaly Osipov: u v v u
9 Local Search Does it help? local search steps / n k=64 k=32 k=16 k=8 k=4 k= n without stoping criteria less than 1% improvement orders of magnitute slower 9 Vitaly Osipov:
10 Experimental evaluation Platform and Settings System one core of Intel Xeon Quad-core Processor featuring 2x4 MB of L2 cache and clocked at GHz of a 2 processor Intel Xeon X5355 node 16 GB of RAM gcc % imbalance 10 repetitions for the small networks 5 repetitions for the other averaging over multiple instances geometric mean 10 Vitaly Osipov:
11 Experimental evaluation two sets - small and large real world graphs Table: Geometric means (times, cut values) over all instances. code small graphs large graphs social networks best avg. t[s] best avg. t[s] best avg. t[s] KaSPar strong KaSPar fast KaSPar fast, α = KaPPa strong KaPPa fast kmetis Scotch On average 5.9% vs KaPPa strong (small instances) 8.1% vs KaPPa strong (large instances) 32% vs Metis (large instances) Repeating scotch as long as KasPar strong run and choosing the best result 12.1% larger cuts 11 Vitaly Osipov:
12 Experimental evaluation Walshaw s Benchmark 34 graphs for k {2, 4, 8, 16, 32, 64} and balance parameter ɛ {0, 0.01, 0.03, 0.05} 816 entries KasPar in one hour per entrie improved 155 values 42 for 1% 49 for 3% 64 for 5% if counting only large graphs (>44K nodes) and ɛ > 0 63% of the entries reproduces equaly good cuts in 83 other entries 12 Vitaly Osipov:
13 Future work better initial partitioner for large k exploit shared memory parallelism 13 Vitaly Osipov:
14 Thank you! 14 Vitaly Osipov:
Nikolaj Leischner, Vitaly Osipov, Peter Sanders
Nikolaj Leichner, Vitaly Oipov, Peter Sander Intitut für Theoretiche Informatik - Algorithmik II Oipov: 1 KITVitaly Univerität de Lande Baden-Württemerg und nationale Groforchungzentrum in der Helmholtz-Gemeinchaft
More informationMonte Carlo neutron transport and thermal-hydraulic simulations using Serpent 2/SUBCHANFLOW
Monte Carlo neutron transport and thermal-hydraulic simulations using Serpent 2/SUBCHANFLOW M. Knebel (Presented by V. Valtavirta) Institute for Neutron Physics and Reactor Technology (INR) Reactor Physics
More informationDynamical Low-Rank Approximation to the Solution of Wave Equations
Dynamical Low-Rank Approximation to the Solution of Wave Equations Julia Schweitzer joint work with Marlis Hochbruck INSTITUT FÜR ANGEWANDTE UND NUMERISCHE MATHEMATIK 1 KIT Universität des Landes Baden-Württemberg
More informationDemonstration of Full PWR Core Coupled Monte Carlo Neutron Transport and Thermal-Hydraulic Simulations Using Serpent 2/ SUBCHANFLOW
Demonstration of Full PWR Core Coupled Monte Carlo Neutron Transport and Thermal-Hydraulic Simulations Using Serpent 2/ SUBCHANFLOW M. Daeubler Institute for Neutron Physics and Reactor Technology (INR)
More informationParallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors
Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1 1 Deparment of Computer
More informationMessung des differentiellen Wirkungsquerschnitts der Z-Boson Produktion im Elektron-Zerfallskanal mit dem CMS-Detektor bei s=8 TeV.
Messung des differentiellen Wirkungsquerschnitts der Z-Boson Produktion im Elektron-Zerfallskanal mit dem CMS-Detektor bei s=8 TeV. DPG Frühjahrstagung 2015 Dominik Haitz, Klaus Rabbertz, Günter Quast
More informationJ.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009
Parallel Preconditioning of Linear Systems based on ILUPACK for Multithreaded Architectures J.I. Aliaga M. Bollhöfer 2 A.F. Martín E.S. Quintana-Ortí Deparment of Computer Science and Engineering, Univ.
More informationIncomplete Cholesky preconditioners that exploit the low-rank property
anapov@ulb.ac.be ; http://homepages.ulb.ac.be/ anapov/ 1 / 35 Incomplete Cholesky preconditioners that exploit the low-rank property (theory and practice) Artem Napov Service de Métrologie Nucléaire, Université
More information2016 SM H ττ Analysis 26th September 2016
2016 SM H ττ Analysis 26th September 2016 INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK (EKP) FAKULTÄT FÜR PHYSIK KIT Universität des Landes Baden Württemberg und nationales Forschungszentrum in der Helmholtz
More informationMinimizing Clock Latency Range in Robust Clock Tree Synthesis
Minimizing Clock Latency Range in Robust Clock Tree Synthesis Wen-Hao Liu Yih-Lang Li Hui-Chi Chen You have to enlarge your font. Many pages are hard to view. I think the position of Page topic is too
More informationSPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics
SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS
More informationMultilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota
Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM CSE Boston - March 1, 2013 First: Joint work with Ruipeng Li Work
More informationStreamSVM Linear SVMs and Logistic Regression When Data Does Not Fit In Memory
StreamSVM Linear SVMs and Logistic Regression When Data Does Not Fit In Memory S.V. N. (vishy) Vishwanathan Purdue University and Microsoft vishy@purdue.edu October 9, 2012 S.V. N. Vishwanathan (Purdue,
More informationHiggs Boson Physics. Analysis Techniques. Günter Quast, Roger Wolf, Andrew Gilbert Master-Kurs SS
Higgs Boson Physics Analysis Techniques Günter Quast, Roger Wolf, Andrew Gilbert Master-Kurs SS 2015 Institut für Experimentelle Kernphysik KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum
More informationIntroduction. Marcel Radermacher Algorithmen zur Visualisierung von Graphen
Algorithms for Graph Visualization INSTITUT FÜR THEORETISCHE INFORMATIK FAKULTÄT FÜR INFORMATIK Marcel Radermacher 30.11.2017 1 Introduction Before: always based on some properties: tree, series-parallel
More informationFrom CityGML to Energy ADE - Workflow
From CityGML to Energy ADE - Workflow KIT Universität von Baden-Württemberg und nationales Forschungszentrum der Helmholtz Gemeinschaft www.kit.edu Workflow overview CityGML import Geometric / semantic
More informationAnalytical Modeling of Parallel Programs (Chapter 5) Alexandre David
Analytical Modeling of Parallel Programs (Chapter 5) Alexandre David 1.2.05 1 Topic Overview Sources of overhead in parallel programs. Performance metrics for parallel systems. Effect of granularity on
More informationMixed Integer Programming:
Mixed Integer Programming: Analyzing 12 Years of Progress Roland Wunderling CPLEX Optimizer Architect Background 2001: Manfred Padberg s60 th birthday Bixby et al., Mixed-Integer Programming: A Progress
More informationHighly-scalable branch and bound for maximum monomial agreement
Highly-scalable branch and bound for maximum monomial agreement Jonathan Eckstein (Rutgers) William Hart Cynthia A. Phillips Sandia National Laboratories Sandia National Laboratories is a multi-program
More informationCSCI Final Project Report A Parallel Implementation of Viterbi s Decoding Algorithm
CSCI 1760 - Final Project Report A Parallel Implementation of Viterbi s Decoding Algorithm Shay Mozes Brown University shay@cs.brown.edu Abstract. This report describes parallel Java implementations of
More informationAlgorithms for Graph Visualization Force-Directed Algorithms
Algorithms for Graph Visualization INSTITUT FÜR THEORETISCHE INFORMATIK FAKULTÄT FÜR INFORMATIK Tamara Mchedlidze 21.12.2016 1 Introduction Before: always based on some properties: tree, series-parallel
More informationOeding (Auburn) tensors of rank 5 December 15, / 24
Oeding (Auburn) 2 2 2 2 2 tensors of rank 5 December 15, 2015 1 / 24 Recall Peter Burgisser s overview lecture (Jan Draisma s SIAM News article). Big Goal: Bound the computational complexity of det n,
More informationMassively parallel semi-lagrangian solution of the 6d Vlasov-Poisson problem
Massively parallel semi-lagrangian solution of the 6d Vlasov-Poisson problem Katharina Kormann 1 Klaus Reuter 2 Markus Rampp 2 Eric Sonnendrücker 1 1 Max Planck Institut für Plasmaphysik 2 Max Planck Computing
More informationA Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters
A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!
More informationComposition-Climate Interactions: The recent past
Composition-Climate Interactions: The recent past Peter Braesicke I Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag) KIT Universität des Landes Baden-Württemberg und
More informationTechnische Universität Dresden Institute of Numerical Mathematics
Technische Universität Dresden Institute of Numerical Mathematics An Improved Flow-based Formulation and Reduction Principles for the Minimum Connectivity Inference Problem Muhammad Abid Dar Andreas Fischer
More informationUTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement
UTPlaceF 3.0: A Parallelization Framework for Modern FPGA Global Placement Wuxi Li, Meng Li, Jiajun Wang, and David Z. Pan University of Texas at Austin wuxili@utexas.edu November 14, 2017 UT DA Wuxi Li
More informationAnalytical Modeling of Parallel Systems
Analytical Modeling of Parallel Systems Chieh-Sen (Jason) Huang Department of Applied Mathematics National Sun Yat-sen University Thank Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar for providing
More informationUV-LED Module Design with Maximum Power Density
UV-LED Module Design with Maximum Power Density Manfred Scholdt 1, Christian Herbold 1, Marc Schneider 2, Cornelius Neumann 1 1 e 2 Institute for Data Processing and Electronics e (LTI) KIT Universität
More informationModels: Amdahl s Law, PRAM, α-β Tal Ben-Nun
spcl.inf.ethz.ch @spcl_eth Models: Amdahl s Law, PRAM, α-β Tal Ben-Nun Design of Parallel and High-Performance Computing Fall 2017 DPHPC Overview cache coherency memory models 2 Speedup An application
More informationPopulation Estimation: Using High-Performance Computing in Statistical Research. Craig Finch Zia Rehman
Population Estimation: Using High-Performance Computing in Statistical Research Craig Finch Zia Rehman Statistical estimation Estimated Value Confidence Interval Actual Value Estimator: a rule for finding
More informationCSE 4502/5717 Big Data Analytics Spring 2018; Homework 1 Solutions
CSE 502/5717 Big Data Analytics Spring 2018; Homework 1 Solutions 1. Consider the following algorithm: for i := 1 to α n log e n do Pick a random j [1, n]; If a[j] = a[j + 1] or a[j] = a[j 1] then output:
More informationScalable String and Suffix Sorting: Algorithms, Techniques, and Tools
Scalable String and Suffix Sorting: Algorithms, Techniques, and Tools Timo Bingmann Dissertation Defense July 3rd, 28 INSTITUTE OF THEORETICAL INFORMATICS ALGORITHMICS KIT The Research University in the
More informationParallel PIPS-SBB Multi-level parallelism for 2-stage SMIPS. Lluís-Miquel Munguia, Geoffrey M. Oxberry, Deepak Rajan, Yuji Shinano
Parallel PIPS-SBB Multi-level parallelism for 2-stage SMIPS Lluís-Miquel Munguia, Geoffrey M. Oxberry, Deepak Rajan, Yuji Shinano ... Our contribution PIPS-PSBB*: Multi-level parallelism for Stochastic
More informationCharacterizing Quantum Supremacy in Near-Term Devices
Characterizing Quantum Supremacy in Near-Term Devices S. Boixo S. Isakov, V. Smelyanskiy, R. Babbush, M. Smelyanskiy, N. Ding, Z. Jiang, M. J. Bremner, J. Martinis, H. Neven Google January 19th Beyond-classical
More informationA Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations
A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations Peter Benner Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory
More informationSparse solver 64 bit and out-of-core addition
Sparse solver 64 bit and out-of-core addition Prepared By: Richard Link Brian Yuen Martec Limited 1888 Brunswick Street, Suite 400 Halifax, Nova Scotia B3J 3J8 PWGSC Contract Number: W7707-145679 Contract
More informationEnvironment (Parallelizing Query Optimization)
Advanced d Query Optimization i i Techniques in a Parallel Computing Environment (Parallelizing Query Optimization) Wook-Shin Han*, Wooseong Kwak, Jinsoo Lee Guy M. Lohman, Volker Markl Kyungpook National
More informationPetaBricks: Variable Accuracy and Online Learning
PetaBricks: Variable Accuracy and Online Learning Jason Ansel MIT - CSAIL May 4, 2011 Jason Ansel (MIT) PetaBricks May 4, 2011 1 / 40 Outline 1 Motivating Example 2 PetaBricks Language Overview 3 Variable
More informationQR Decomposition in a Multicore Environment
QR Decomposition in a Multicore Environment Omar Ahsan University of Maryland-College Park Advised by Professor Howard Elman College Park, MD oha@cs.umd.edu ABSTRACT In this study we examine performance
More informationMachine learning in Higgs analyses
Machine learning in Higgs analyses Higgs Couplings 2017 Marco A. Harrendorf on behalf of the CMS collaboration October 9th, 2017 INSTITUT FÜR EXPERIMENTELLE TEILCHENPHYSIK (ETP) KIT Universität des Landes
More informationWhere is SUSY? Institut für Experimentelle Kernphysik
Where is SUSY? Institut ür Experimentelle Kernphysik KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschat www.kit.edu I supersymmetric particles exist,
More informationOn the design of parallel linear solvers for large scale problems
On the design of parallel linear solvers for large scale problems ICIAM - August 2015 - Mini-Symposium on Recent advances in matrix computations for extreme-scale computers M. Faverge, X. Lacoste, G. Pichon,
More informationiretilp : An efficient incremental algorithm for min-period retiming under general delay model
iretilp : An efficient incremental algorithm for min-period retiming under general delay model Debasish Das, Jia Wang and Hai Zhou EECS, Northwestern University, Evanston, IL 60201 Place and Route Group,
More informationPreparatory Material for the European Intensive Program in Bydgoszcz 2011 Analytical and computer assisted methods in mathematical models
Preparatory Material for the European Intensive Program in Bydgoszcz 2011 Analytical and computer assisted methods in mathematical models September 4{18 Basics on the Lebesgue integral and the divergence
More informationMulti-Approximate-Keyword Routing Query
Bin Yao 1, Mingwang Tang 2, Feifei Li 2 1 Department of Computer Science and Engineering Shanghai Jiao Tong University, P. R. China 2 School of Computing University of Utah, USA Outline 1 Introduction
More informationA Tight Lower Bound for Dynamic Membership in the External Memory Model
A Tight Lower Bound for Dynamic Membership in the External Memory Model Elad Verbin ITCS, Tsinghua University Qin Zhang Hong Kong University of Science & Technology April 2010 1-1 The computational model
More informationCache-Oblivious Computations: Algorithms and Experimental Evaluation
Cache-Oblivious Computations: Algorithms and Experimental Evaluation Vijaya Ramachandran Department of Computer Sciences University of Texas at Austin Dissertation work of former PhD student Dr. Rezaul
More informationLeveraging Task-Parallelism in Energy-Efficient ILU Preconditioners
Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners José I. Aliaga Leveraging task-parallelism in energy-efficient ILU preconditioners Universidad Jaime I (Castellón, Spain) José I. Aliaga
More informationLow-ranksemidefiniteprogrammingforthe. MAX2SAT problem. Bosch Center for Artificial Intelligence
Low-ranksemidefiniteprogrammingforthe MAX2SAT problem Po-Wei Wang J. Zico Kolter Machine Learning Department Carnegie Mellon University School of Computer Science, Carnegie Mellon University, and Bosch
More informationAlgorithm Theory - Exercise Class
Algorithm Theory - Exercise Class Exercise Lesson 3 Albert-Ludwigs-Universität Freiburg Philipp Schneider Algorithms and Complexity - Professor Dr. Fabian Kuhn Organizational Matters English Tutorial in
More informationThe Finite Cell Method: High order simulation of complex structures without meshing
The Finite Cell Method: High order simulation of complex structures without meshing E. Rank, A. Düster, D. Schillinger, Z. Yang Fakultät für Bauingenieur und Vermessungswesen Technische Universität München,
More informationDrawing Large Graphs by Multilevel Maxent-Stress Optimization
Drawing Large Graphs by Multilevel Maxent-Stress Optimization Henning Meyerhenke, Martin Nöllenburg, Christian Schulz Annweiler, TOR 2016 (ursprünglich präsentiert auf der GD 2015), KIT 1 H. Meyerhenke
More informationarxiv: v1 [cs.dm] 19 Jan 2019
Computing Optimal Tangles Faster arxiv:1901.06548v1 [cs.dm] 19 Jan 2019 Oksana Firman Institut für Informatik, Universität Würzburg, Germany firstname.lastname@uni-wuerzburg.de Philipp Kindermann Institut
More informationPrioritized Garbage Collection Using the Garbage Collector to Support Caching
Prioritized Garbage Collection Using the Garbage Collector to Support Caching Diogenes Nunez, Samuel Z. Guyer, Emery D. Berger Tufts University, University of Massachusetts Amherst November 2, 2016 D.
More informationDiscrete Wiskunde II. Lecture 5: Shortest Paths & Spanning Trees
, 2009 Lecture 5: Shortest Paths & Spanning Trees University of Twente m.uetz@utwente.nl wwwhome.math.utwente.nl/~uetzm/dw/ Shortest Path Problem "#$%&'%()*%"()$#+,&- Given directed "#$%&'()*+,%+('-*.#/'01234564'.*,'7+"-%/8',&'5"4'84%#3
More informationEmbedded Systems - FS 2018
Institut für Technische Informatik und Kommunikationsnetze Prof. L. Thiele Embedded Systems - FS 2018 Sample solution to Exercise 3 Discussion Date: 11.4.2018 Aperiodic Scheduling Task 1: Earliest Deadline
More informationParallel Transposition of Sparse Data Structures
Parallel Transposition of Sparse Data Structures Hao Wang, Weifeng Liu, Kaixi Hou, Wu-chun Feng Department of Computer Science, Virginia Tech Niels Bohr Institute, University of Copenhagen Scientific Computing
More informationA Practical Parallel Algorithm for Diameter Approximation of Massive Weighted Graphs
A Practical Parallel Algorithm for Diameter Approximation of Massive Weighted Graphs Matteo Ceccarello Joint work with Andrea Pietracaprina, Geppino Pucci, and Eli Upfal Università di Padova Brown University
More informationDES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models
4. Petri Nets Introduction Different Classes of Petri Net Petri net properties Analysis of Petri net models 1 Petri Nets C.A Petri, TU Darmstadt, 1962 A mathematical and graphical modeling method. Describe
More informationOn the design of parallel linear solvers for large scale problems
On the design of parallel linear solvers for large scale problems Journée problème de Poisson, IHP, Paris M. Faverge, P. Ramet M. Faverge Assistant Professor Bordeaux INP LaBRI Inria Bordeaux - Sud-Ouest
More informationPerformance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain
March 4-5, 2015 Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain M. Bonnasse-Gahot 1,2, H. Calandra 3, J. Diaz 1 and S. Lanteri 2
More informationModels, Data, Learning Problems
Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Models, Data, Learning Problems Tobias Scheffer Overview Types of learning problems: Supervised Learning (Classification, Regression,
More informationTopic 1: Atmosphere and Climate
Topic 1: Atmosphere and Climate Peter Braesicke Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag) KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum
More informationComputing least squares condition numbers on hybrid multicore/gpu systems
Computing least squares condition numbers on hybrid multicore/gpu systems M. Baboulin and J. Dongarra and R. Lacroix Abstract This paper presents an efficient computation for least squares conditioning
More informationParallelism in Structured Newton Computations
Parallelism in Structured Newton Computations Thomas F Coleman and Wei u Department of Combinatorics and Optimization University of Waterloo Waterloo, Ontario, Canada N2L 3G1 E-mail: tfcoleman@uwaterlooca
More informationOn the Computational Complexity of the Discrete Pascal Transform
6 th International Conference Logic and Applications LAP 207, September 8-22, 207, Dubrovnik, Croatia On the Computational Complexity of the Discrete Pascal Transform Dušan B. Gajić, Radomir S. Stanković
More informationPerformance Metrics for Computer Systems. CASS 2018 Lavanya Ramapantulu
Performance Metrics for Computer Systems CASS 2018 Lavanya Ramapantulu Eight Great Ideas in Computer Architecture Design for Moore s Law Use abstraction to simplify design Make the common case fast Performance
More informationPatrick Kirchgaeßer 07. Januar 2016
Patrick Kirchgaeßer 07. Januar 2016 INSTITUTE OF EXPERIMENTAL PARTICLE PHYSICS (IEKP) PHYSICS FACULTY KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft
More informationICS 252 Introduction to Computer Design
ICS 252 fall 2006 Eli Bozorgzadeh Computer Science Department-UCI References and Copyright Textbooks referred [Mic94] G. De Micheli Synthesis and Optimization of Digital Circuits McGraw-Hill, 1994. [CLR90]
More informationT U M. Performing Permuting on a Multiprocessor Architecture Using Packet Communication I N S T I T U T F Ü R I N F O R M A T I K
T U M I S T I T U T F Ü R I F O R M A T I K Performing Permuting on a Multiprocessor Architecture Using Packet Communication Riko Jacob Michael Schnupp ABCDE FGHIJ KLMO TUM-I0817 Juni 08 T E C H I S C
More informationModule 5: CPU Scheduling
Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation 5.1 Basic Concepts Maximum CPU utilization obtained
More informationAnalysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College
Analysis of Algorithms [Reading: CLRS 2.2, 3] Laura Toma, csci2200, Bowdoin College Why analysis? We want to predict how the algorithm will behave (e.g. running time) on arbitrary inputs, and how it will
More informationAn Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets
IEEE Big Data 2015 Big Data in Geosciences Workshop An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets Fatih Akdag and Christoph F. Eick Department of Computer
More informationBalanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems
Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Jos M. Badía 1, Peter Benner 2, Rafael Mayo 1, Enrique S. Quintana-Ortí 1, Gregorio Quintana-Ortí 1, A. Remón 1 1 Depto.
More informationChapter 6: CPU Scheduling
Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation 6.1 Basic Concepts Maximum CPU utilization obtained
More informationMultitasking Polynomial Homotopy Continuation in PHCpack. Jan Verschelde
Multitasking Polynomial Homotopy Continuation in PHCpack Jan Verschelde University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science http://www.math.uic.edu/ jan jan@math.uic.edu
More informationDiscrete Optimization 2010 Lecture 3 Maximum Flows
Remainder: Shortest Paths Maximum Flows Discrete Optimization 2010 Lecture 3 Maximum Flows Marc Uetz University of Twente m.uetz@utwente.nl Lecture 3: sheet 1 / 29 Marc Uetz Discrete Optimization Outline
More informationMPI at MPI. Jens Saak. Max Planck Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory
MAX PLANCK INSTITUTE November 5, 2010 MPI at MPI Jens Saak Max Planck Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory FOR DYNAMICS OF COMPLEX TECHNICAL
More informationStatic-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems
Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Ichitaro Yamazaki University of Tennessee, Knoxville Xiaoye Sherry Li Lawrence Berkeley National Laboratory MS49: Sparse
More informationNovel Heuristics for Coalition Structure Generation in Multi-agent Systems
, June 3 - July 2, 21, London, U.K. Novel Heuristics for Coalition Structure Generation in Multi-agent Systems Mustansar Ali Ghazanfar and Adam Prugel-Bennett Abstract A coalition is a set of self-interested
More informationJacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA
Jacobi-Based Eigenvalue Solver on GPU Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline Symmetric eigenvalue solver Experiment Applications Conclusions Symmetric eigenvalue solver The standard form is
More informationScalable Software for Multivariate Integration on Hybrid Platforms
Journal of Physics: Conference Series PAPER OPEN ACCESS Scalable Software for Multivariate Integration on Hybrid Platforms To cite this article: E de Doncker et al 2015 J. Phys.: Conf. Ser. 640 012062
More informationFINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION
FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros
More informationMeasurement of Jet Energy Scale and Resolution at ATLAS and CMS at s = 8 TeV
Measurement of Jet Energy Scale and Resolution at ATLAS and CMS at s = 8 TeV EDSBlois 2015 02.07.2015 Dominik Haitz on behalf of the ATLAS and CMS Collaborations INSTITUT FÜR EXPERIMENTELLE KERNPHYSIK
More informationFINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION
FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros
More informationGPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic
GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago
More informationA Better Scoring Model for De Novo Peptide Sequencing: The Symmetric Difference between Explained and Measured Masses Supplementary Figures
A Better Scoring Model for De Novo Peptide Sequencing: The Symmetric Difference between Explained and Measured Masses Supplementary Figures Thomas Tschager *, Simon Rösch *, Ludovic Gillet 2 and Peter
More informationAtmospheric Water Vapor Effect on GNSS Signals and InSAR Data
Atmospheric Water Vapor Effect on GNSS Signals and InSAR Data Basic Concept and Preliminary Results F. Alshawaf 1, S. Hinz 1, A. Thiele 1 T. Fuhrmann 2, B. Heck 2, A. Knöpfler 2, X. Luo 2, M. Mayer 2,
More informationAdvanced Data Structures
Simon Gog gog@kit.edu - 0 Simon Gog: KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu Dynamic Perfect Hashing What we want: O(1) lookup
More informationA parallel exponential integrator for large-scale discretizations of advection-diffusion models
A parallel exponential integrator for large-scale discretizations of advection-diffusion models L. Bergamaschi 1, M. Caliari 2, A. Martínez 3, and M. Vianello 3 1 Department of Mathematical Methods and
More informationSome Geometric and Algebraic Aspects of Domain Decomposition Methods
Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition
More informationThe Number of Spanning Trees in a Graph
Course Trees The Ubiquitous Structure in Computer Science and Mathematics, JASS 08 The Number of Spanning Trees in a Graph Konstantin Pieper Fakultät für Mathematik TU München April 28, 2008 Konstantin
More informationarxiv: v1 [math.co] 3 Feb 2014
Enumeration of nonisomorphic Hamiltonian cycles on square grid graphs arxiv:1402.0545v1 [math.co] 3 Feb 2014 Abstract Ed Wynn 175 Edmund Road, Sheffield S2 4EG, U.K. The enumeration of Hamiltonian cycles
More informationOzone_CCI Contribution
Ozone_CCI Contribution Peter Braesicke (CRG) on behalf of the Ozone_CCI Team Science Lead: Michel Van Roozendael, BIRA Fügen Sie auf der Masterfolie ein frei wählbares Bild ein (z.b. passend zum Vortrag)
More informationc 2015 Society for Industrial and Applied Mathematics
SIAM J. SCI. COMPUT. Vol. 37, No. 2, pp. C169 C193 c 2015 Society for Industrial and Applied Mathematics FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper
More informationCSL model checking of biochemical networks with Interval Decision Diagrams
CSL model checking of biochemical networks with Interval Decision Diagrams Brandenburg University of Technology Cottbus Computer Science Department http://www-dssz.informatik.tu-cottbus.de/software/mc.html
More informationDense Arithmetic over Finite Fields with CUMODP
Dense Arithmetic over Finite Fields with CUMODP Sardar Anisul Haque 1 Xin Li 2 Farnam Mansouri 1 Marc Moreno Maza 1 Wei Pan 3 Ning Xie 1 1 University of Western Ontario, Canada 2 Universidad Carlos III,
More informationTrust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization
Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization Denis Ridzal Department of Computational and Applied Mathematics Rice University, Houston, Texas dridzal@caam.rice.edu
More informationOn the Performance of SC-MMSE-FD Equalization for Fixed-Point Implementations
On the Performance of SC-MMSE-FD Equalization for Fixed-Point Implementations ISTC 2014, Bremen, Tamal Bose and Friedrich K. Jondral KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum
More information