Imaging using GPU. V-K Veligatla, Kapteyn Institute P. Labropoulos, ASTRON and Kapteyn Institute L. Koopmans, Kapteyn Institute

Size: px
Start display at page:

Download "Imaging using GPU. V-K Veligatla, Kapteyn Institute P. Labropoulos, ASTRON and Kapteyn Institute L. Koopmans, Kapteyn Institute"

Transcription

1 Imaging using GPU V-K Veligatla, Kapteyn Institute P. Labropoulos, ASTRON and Kapteyn Institute L. Koopmans, Kapteyn Institute

2 Introduction What is a GPU? Why another Imager? Large amount of data to be processed. W-projection is a requirement. Issues to Consider Wide field imaging Full Stokes Speed Experiment with New approaches

3 Basic Imaging Equation V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 e 2 2 For low frequency wide-field imaging the assumption w~0 does not hold If w=0, then visibilities = FFT(image) But this is not always the case hence WProjection. (S. Bhatnagar and T.J.Cornwell) dl dm

4 W-Projection V u, v, w = I l, m B l, m 1 l m 2 2 e 2 i ul vm w 1 l m dl dm Assuming W is constant for a given plane of UV values V u, v, w = e 2 i [w 1 l m 1 ] 2 2 I l, m B l,m 1 l m 2 Correction CorrectionTerm Term 2 e 2 i ul vm dl dm

5 Block Diagram Imaging+ correcting W planes Corrected Image

6 Wproj Convolution function W=0 W=64 W=127

7 Results (Simulation) Normal Imaging W-Projected

8 Results (Simulation) Normal Imaging W-Projected

9 3C196 W-Corrected

10 Full Stokes (Sim) II QQ U U V V

11 Full Stokes 3C196 I Q U V

12 LOFAR Eor GPU Cluster Configuration 2 x Tesla M1060 (GPU) Computing capability Processing 602 MHz 4GB Memory 16 x Intel(R) Xeon(R) CPU E GHz 12 GB Ram 80 nodes

13 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager

14 Without W-Projection T i me I n S ec ond s GpuImager CasaImager

15 3C196 8 sq. degrees 256 Planes 80 Nodes 244 sub-bands In About 5 mins

16 3C196 (zoomed) 0.5 mjy rms noise at the edge of the image

17 Future Plans We are working on implementation of MVDR on the gpu. Experimenting with Weighting and Convolution schemes in the UV plane. Incorporating a very fast calibration scheme into the imager so both are done together

18 Thank You

19 Introduction What is a GPU? Why another Imager? Large amount of data to be processed. W-projection is a requirement. Issues to Consider Wide field imaging Full Stokes Speed Experiment with New approaches

20 Why use GPU Hundreds of Processors Very well suited for matrix operations. Well suited for parallel tasks. With the latest APIs (CUDA, OpenCL) easy to program.

21 Basic Imaging Equation V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 e In general W~0 (U,V,W=0). This implies visibilities = FFT(image) 2 But this is not always the case hence WProjection. (T.J.Cornwell et all) 2 dl dm

22 W-Projection V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 2 e 2 dl dm Assuming W is constant for a given plane of UV values 2 i[w 1 l 2 m 2 1 ] V u, v, w = e I l, m B l, m 1 l m i ul vm e G l, m, w V u, v, w =G l, m, w V u, v, w=0 IFFT V u, v, w =G l, m, w IFFT V u, v, w=0 Ref:Tim Cornwell and Liu dl dm

23 Block Diagram Imaging+ correcting W planes Corrected Image

24 W Correction W=0 W=64 W=127

25 Results (Simulation) Normal Imaging W-Projected

26 Results (Simulation) Normal Imaging W-Projected

27 3C196 W-Corrected

28 Full Stokes (Sim) II QQ U U V V

29 Full Stokes 3C196 I Q U V

30 LOFAR Eor GPU Cluster Configuration 2 x Tesla M1060 (GPU) Compute 1.3 capable MHz 4GB Memory 16 x Intel(R) Xeon(R) CPU 12 GB Ram We have 80 Such Nodes

31 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager

32 Without W-Projection T i me I n S ec ond s GpuImager CasaImager

33 3C196 GPU Imager W-Projection 256 Planes 80 Nodes In About 5 mins

34 3C196 (zoomed)

35 LOFAR Eor GPU Cluster Configuration 2 x Tesla C1060 (GPU) Compute 1.3 capable MHz 4GB Memory 8 x Intel(R) Xeon(R) CPU 12 GB Ram We have 80 Such Nodes

36 Results (Simulation) Normal Imaging W-Projected

37 3C196 W-Corrected

38 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager

39 3C196 GPU Imager W-Projection 256 Planes 80 Nodes In About 5 mins

40 Minimum Variance Distortionless Response Minimizes side lobe gains (cause beam to vary spatially) Computationally intensive Parallel computation possible. Dirty Image Equation: H I k l, m = A k l, m R k A k l, m I k l, m Pixel Intensity at l,m at time instance k A k l, m Antenna steering vector Rk Array correlation matrix (ACM).

41 MVDR The weights in dirty image A(l,m) are replaced with W(l,m) Weight set to minimize influence of interfering sources. W =argmin w W H R W Under the constraint W H A=1 H 1 A R H W = H 1 A R A H I k l, m =W R W Ref: R. Levanda and Amir Leshem, Liu at al., Stoica et al.

42 Challenges H 1 A R W = H 1 A R A H Inverse of Visibility Matrix ACM close to singular. Diagonal loading, (non-linear) dimensionality reduction L-curve type of analysis to select diagonal loading strength

43 Challenges: beamformer power vs time itle:/home/vkrishna/documents/experi/ Creator:MATLAB, The MathWorks, Inc. Vers CreationDate:07/07/ :17:26 anguagelevel:2 Flagged/missing data => Higher Diagonal Loading Factor Higher regularization factor => higher the noise in that epoch Cholesky Factorization to determine level of Regularization.

44 Noisy data

45 Results

46 CPU vs GPU CPU vs GPU 350 GPU CPU 300 Threads equivalent to #processors on system Each Thread processes part of final image Time in Minutes CPU implementation: GPU Implementation: Multiple threads process a single pixel Each thread calculates component of vector Size of Image 2048

47 Optimization Varying Block Size Varying Register Count Multiprocessor Warp Occupancy Multiprocessor Warp Occupancy Registers Per Thread Threads Per Block

48

49

50

51

52

53 Summary Since we produce snapshots we can also correct them for image plane effects on the fly Currently implemented a station beam library Highly scalable New methods of Imaging can be explored

54 Thank You

Array signal processing for radio-astronomy imaging and future radio telescopes

Array signal processing for radio-astronomy imaging and future radio telescopes Array signal processing for radio-astronomy imaging and future radio telescopes School of Engineering Bar-Ilan University Amir Leshem Faculty of EEMCS Delft University of Technology Joint wor with Alle-Jan

More information

Jacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA

Jacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA Jacobi-Based Eigenvalue Solver on GPU Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline Symmetric eigenvalue solver Experiment Applications Conclusions Symmetric eigenvalue solver The standard form is

More information

From LOFAR to SKA, challenges in distributed computing. Soobash Daiboo Paris Observatory -LESIA

From LOFAR to SKA, challenges in distributed computing. Soobash Daiboo Paris Observatory -LESIA From LOFAR to SKA, challenges in distributed computing Soobash Daiboo Paris Observatory -LESIA Overview LOFAR telescope Data processing with LOFAR NenuFar SKA Summary LOFAR science drivers Key science

More information

Multiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU

Multiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU Multiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU Khramtsov D.P., Nekrasov D.A., Pokusaev B.G. Department of Thermodynamics, Thermal Engineering and Energy Saving Technologies,

More information

Confusion, the sky model and realistic simulations

Confusion, the sky model and realistic simulations Confusion, the sky model and realistic simulations Randall Wayth (CfA) with team Greenhill (Greenhill, Mitchell, Sault) team MAPS (Doeleman, Bhat) Ashes Update: England all out for 350. Australia wins

More information

The Probability Distribution of the MVDR Beamformer Outputs under Diagonal Loading. N. Raj Rao (Dept. of Electrical Engineering and Computer Science)

The Probability Distribution of the MVDR Beamformer Outputs under Diagonal Loading. N. Raj Rao (Dept. of Electrical Engineering and Computer Science) The Probability Distribution of the MVDR Beamformer Outputs under Diagonal Loading N. Raj Rao (Dept. of Electrical Engineering and Computer Science) & Alan Edelman (Dept. of Mathematics) Work supported

More information

Wide-Field Imaging: I

Wide-Field Imaging: I Wide-Field Imaging: I S. Bhatnagar NRAO, Socorro Twelfth Synthesis Imaging Workshop 010 June 8-15 Wide-field imaging What do we mean by wide-field imaging W-Term: D Fourier transform approximation breaks

More information

Statistical inversion of the LOFAR Epoch of Reionization experiment data model

Statistical inversion of the LOFAR Epoch of Reionization experiment data model Statistical inversion of the LOFAR Epoch of Reionization experiment data model ASTRON, Oude Hoogeveensedijk 4, 7991 PD, Dwingeloo, the Netherlands Kapteyn Astronomical Institute, Landleven 12, 9747 AD,

More information

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!

More information

LOFAR Polarization Commissioning by the MKSP

LOFAR Polarization Commissioning by the MKSP LOFAR Polarization Commissioning by the MKSP James M Anderson anderson@mpifr-bonn.mpg.de On behalf of the LOFAR collaboration Surveys KSP Meeting, Leiden, 2010 March 11 James M Anderson 1/15 Timeline 2010

More information

ALMA Development Program

ALMA Development Program ALMA Development Program Jeff Kern CASA Team Lead Atacama Large Millimeter/submillimeter Array Expanded Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array Opportunities for Software

More information

Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters

Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Jonathan Lifflander, G. Carl Evans, Anshu Arya, Laxmikant Kale University of Illinois Urbana-Champaign May 25, 2012 Work is overdecomposed

More information

Beamspace Adaptive Beamforming and the GSC

Beamspace Adaptive Beamforming and the GSC April 27, 2011 Overview The MVDR beamformer: performance and behavior. Generalized Sidelobe Canceller reformulation. Implementation of the GSC. Beamspace interpretation of the GSC. Reduced complexity of

More information

Recent Progress of Parallel SAMCEF with MUMPS MUMPS User Group Meeting 2013

Recent Progress of Parallel SAMCEF with MUMPS MUMPS User Group Meeting 2013 Recent Progress of Parallel SAMCEF with User Group Meeting 213 Jean-Pierre Delsemme Product Development Manager Summary SAMCEF, a brief history Co-simulation, a good candidate for parallel processing MAAXIMUS,

More information

Computing least squares condition numbers on hybrid multicore/gpu systems

Computing least squares condition numbers on hybrid multicore/gpu systems Computing least squares condition numbers on hybrid multicore/gpu systems M. Baboulin and J. Dongarra and R. Lacroix Abstract This paper presents an efficient computation for least squares conditioning

More information

Multicore Parallelization of Determinant Quantum Monte Carlo Simulations

Multicore Parallelization of Determinant Quantum Monte Carlo Simulations Multicore Parallelization of Determinant Quantum Monte Carlo Simulations Andrés Tomás, Che-Rung Lee, Zhaojun Bai, Richard Scalettar UC Davis SIAM Conference on Computation Science & Engineering Reno, March

More information

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Debbie Bard Matt Bellis Cosmology: the study of the nature and history of the Universe History of Universe driven by competing forces:

More information

Low-frequency radio astronomy and wide-field imaging

Low-frequency radio astronomy and wide-field imaging Low-frequency radio astronomy and wide-field imaging James Miller-Jones (NRAO Charlottesville/Curtin University) ITN 215212: Black Hole Universe Many slides taken from NRAO Synthesis Imaging Workshop (Tracy

More information

Data analysis of massive data sets a Planck example

Data analysis of massive data sets a Planck example Data analysis of massive data sets a Planck example Radek Stompor (APC) LOFAR workshop, Meudon, 29/03/06 Outline 1. Planck mission; 2. Planck data set; 3. Planck data analysis plan and challenges; 4. Planck

More information

Robust Capon Beamforming

Robust Capon Beamforming Robust Capon Beamforming Yi Jiang Petre Stoica Zhisong Wang Jian Li University of Florida Uppsala University University of Florida University of Florida March 11, 2003 ASAP Workshop 2003 1 Outline Standard

More information

Multi-GPU Simulations of the Infinite Universe

Multi-GPU Simulations of the Infinite Universe () Multi-GPU of the Infinite with with G. Rácz, I. Szapudi & L. Dobos Physics of Complex Systems Department Eötvös Loránd University, Budapest June 22, 2018, Budapest, Hungary Outline 1 () 2 () Concordance

More information

Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers

Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers UT College of Engineering Tutorial Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers Stan Tomov 1, George Bosilca 1, and Cédric

More information

Parametric high resolution techniques for radio astronomical imaging

Parametric high resolution techniques for radio astronomical imaging Parametric high resolution techniques for radio astronomical imaging 1 arxiv:0807.4852v1 [astro-ph] 30 Jul 2008 Chen Ben-David and Amir Leshem (Senior Member) School of Engineering Bar-Ilan University

More information

CHAPTER 3 ROBUST ADAPTIVE BEAMFORMING

CHAPTER 3 ROBUST ADAPTIVE BEAMFORMING 50 CHAPTER 3 ROBUST ADAPTIVE BEAMFORMING 3.1 INTRODUCTION Adaptive beamforming is used for enhancing a desired signal while suppressing noise and interference at the output of an array of sensors. It is

More information

ACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS

ACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS ACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS Bojan Musizza, Dejan Petelin, Juš Kocijan, Jožef Stefan Institute Jamova 39, Ljubljana, Slovenia University of Nova Gorica Vipavska 3, Nova Gorica, Slovenia

More information

S0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA

S0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA S0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA Date: 16th May 2012 Wed, 3pm to 3.25pm(Adv. Session) Sathyanarayana K., Manish Banga, and Ravi Kumar G. V. V. Engineering Services,

More information

Julian Merten. GPU Computing and Alternative Architecture

Julian Merten. GPU Computing and Alternative Architecture Future Directions of Cosmological Simulations / Edinburgh 1 / 16 Julian Merten GPU Computing and Alternative Architecture Institut für Theoretische Astrophysik Zentrum für Astronomie Universität Heidelberg

More information

A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT

A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT Progress In Electromagnetics Research Letters, Vol. 16, 53 60, 2010 A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT Y. P. Liu and Q. Wan School of Electronic Engineering University of Electronic

More information

arxiv: v1 [astro-ph.im] 16 Apr 2009

arxiv: v1 [astro-ph.im] 16 Apr 2009 Closed form solution of the maximum entropy equations with application to fast radio astronomical image formation arxiv:0904.2545v1 [astro-ph.im] 16 Apr 2009 Amir Leshem 1 School of Engineering, Bar-Ilan

More information

COMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD

COMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD XVIII International Conference on Water Resources CMWR 2010 J. Carrera (Ed) c CIMNE, Barcelona, 2010 COMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD James.E. McClure, Jan F. Prins

More information

Future Improvements of Weather and Climate Prediction

Future Improvements of Weather and Climate Prediction Future Improvements of Weather and Climate Prediction Unidata Policy Committee October 21, 2010 Alexander E. MacDonald, Ph.D. Deputy Assistant Administrator for Labs and Cooperative Institutes & Director,

More information

Real-time signal detection for pulsars and radio transients using GPUs

Real-time signal detection for pulsars and radio transients using GPUs Real-time signal detection for pulsars and radio transients using GPUs W. Armour, M. Giles, A. Karastergiou and C. Williams. University of Oxford. 15 th July 2013 1 Background of GPUs Why use GPUs? Influence

More information

Radio Astronomical Image Formation using Constrained Least Squares and Krylov Subspaces

Radio Astronomical Image Formation using Constrained Least Squares and Krylov Subspaces Radio Astronomical Image Formation using Constrained Least Squares and Krylov Subspaces A. Mouri Sardarabadi, Amir Leshem 2, Alle-Jan van der Veen arxiv:55.348v [astro-ph.im] 6 May 25 Abstract Image formation

More information

ROBUST ADAPTIVE BEAMFORMING BASED ON CO- VARIANCE MATRIX RECONSTRUCTION FOR LOOK DIRECTION MISMATCH

ROBUST ADAPTIVE BEAMFORMING BASED ON CO- VARIANCE MATRIX RECONSTRUCTION FOR LOOK DIRECTION MISMATCH Progress In Electromagnetics Research Letters, Vol. 25, 37 46, 2011 ROBUST ADAPTIVE BEAMFORMING BASED ON CO- VARIANCE MATRIX RECONSTRUCTION FOR LOOK DIRECTION MISMATCH R. Mallipeddi 1, J. P. Lie 2, S.

More information

Quality Report Generated with Pix4Dmapper Pro version

Quality Report Generated with Pix4Dmapper Pro version Quality Report Generated with Pix4Dmapper Pro version 3.0.13 Important: Click on the different icons for: Help to analyze the results in the Quality Report Additional information about the sections Click

More information

German LOFAR a new era in radio astronomy

German LOFAR a new era in radio astronomy German LOFAR a new era in radio astronomy Marcus Brüggen International University Bremen Rainer Beck (MPIfR) Heino Falcke (ASTRON) Anton Zensus (MPIfR) Köln, 28. Sep 2005 Outline Why LOFAR? What is happening

More information

arxiv: v1 [hep-lat] 7 Oct 2010

arxiv: v1 [hep-lat] 7 Oct 2010 arxiv:.486v [hep-lat] 7 Oct 2 Nuno Cardoso CFTP, Instituto Superior Técnico E-mail: nunocardoso@cftp.ist.utl.pt Pedro Bicudo CFTP, Instituto Superior Técnico E-mail: bicudo@ist.utl.pt We discuss the CUDA

More information

PSEUDORANDOM numbers are very important in practice

PSEUDORANDOM numbers are very important in practice Proceedings of the Federated Conference on Computer Science and Information Systems pp 571 578 ISBN 978-83-681-51-4 Parallel GPU-accelerated Recursion-based Generators of Pseudorandom Numbers Przemysław

More information

Direct Self-Consistent Field Computations on GPU Clusters

Direct Self-Consistent Field Computations on GPU Clusters Direct Self-Consistent Field Computations on GPU Clusters Guochun Shi, Volodymyr Kindratenko National Center for Supercomputing Applications University of Illinois at UrbanaChampaign Ivan Ufimtsev, Todd

More information

Large Field of View Radio Astronomy; relevant for many KSP s

Large Field of View Radio Astronomy; relevant for many KSP s Large Field of View Radio Astronomy; relevant for many KSP s SKA Key Science Drivers ORIGINS Probing the Dark Ages When & how were the first stars formed? Cosmology and Galaxy Evolution Galaxies, Dark

More information

Parametric High Resolution Techniques for Radio Astronomical Imaging Chen Ben-David and Amir Leshem, Senior Member, IEEE

Parametric High Resolution Techniques for Radio Astronomical Imaging Chen Ben-David and Amir Leshem, Senior Member, IEEE 670 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 2, NO. 5, OCTOBER 2008 Parametric High Resolution Techniques for Radio Astronomical Imaging Chen Ben-David and Amir Leshem, Senior Member,

More information

ADAPTIVE ANTENNAS. SPATIAL BF

ADAPTIVE ANTENNAS. SPATIAL BF ADAPTIVE ANTENNAS SPATIAL BF 1 1-Spatial reference BF -Spatial reference beamforming may not use of embedded training sequences. Instead, the directions of arrival (DoA) of the impinging waves are used

More information

A CUDA Solver for Helmholtz Equation

A CUDA Solver for Helmholtz Equation Journal of Computational Information Systems 11: 24 (2015) 7805 7812 Available at http://www.jofcis.com A CUDA Solver for Helmholtz Equation Mingming REN 1,2,, Xiaoguang LIU 1,2, Gang WANG 1,2 1 College

More information

Computation of Large Sparse Aggregated Areas for Analytic Database Queries

Computation of Large Sparse Aggregated Areas for Analytic Database Queries Computation of Large Sparse Aggregated Areas for Analytic Database Queries Steffen Wittmer Tobias Lauer Jedox AG Collaborators: Zurab Khadikov Alexander Haberstroh Peter Strohm Business Intelligence and

More information

Multivariate density estimation and its applications

Multivariate density estimation and its applications Multivariate density estimation and its applications Wing Hung Wong Stanford University June 2014, Madison, Wisconsin. Conference in honor of the 80 th birthday of Professor Grace Wahba Lessons I learned

More information

SP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay

SP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay SP-CNN: A Scalable and Programmable CNN-based Accelerator Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay Motivation Power is a first-order design constraint, especially for embedded devices. Certain

More information

Behavioral Simulations in MapReduce

Behavioral Simulations in MapReduce Behavioral Simulations in MapReduce Guozhang Wang, Marcos Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan Demers, Johannes Gehrke, Walker White Cornell University 1 What are Behavioral Simulations?

More information

Cosmology with Galaxy Clusters: Observations meet High-Performance-Computing

Cosmology with Galaxy Clusters: Observations meet High-Performance-Computing Cosmology with Galaxy Clusters: Observations meet High-Performance-Computing Julian Merten (ITA/ZAH) Clusters of galaxies GPU lensing codes Abell 2744 CLASH: A HST/MCT programme Clusters of galaxies DM

More information

Determining the Specification of an Aperture Array for Cosmological Surveys

Determining the Specification of an Aperture Array for Cosmological Surveys Determining the Specification of an Aperture Array for Cosmological Surveys P. Alexander University of Cambridge, Department of Physics, Cavendish Laboratory, Cambridge, UK. A. J. aulkner Jodrell Bank

More information

Multi-frequency imaging of Cygnus A with LOFAR

Multi-frequency imaging of Cygnus A with LOFAR Netherlands Institute for Radio Astronomy Multi-frequency imaging of Cygnus A with LOFAR John McKean (ASTRON) and all the imaging busy week team! ASTRON is part of the Netherlands Organisation for Scientific

More information

Machine Learning for Gravitational Wave signals classification in LIGO and Virgo

Machine Learning for Gravitational Wave signals classification in LIGO and Virgo Machine Learning for Gravitational Wave signals classification in LIGO and Virgo Elena Cuoco European Gravitational Observatory www.elenacuoco.com @elenacuoco 2 About me About me Working as Data Analyst

More information

Applications of Robust Optimization in Signal Processing: Beamforming and Power Control Fall 2012

Applications of Robust Optimization in Signal Processing: Beamforming and Power Control Fall 2012 Applications of Robust Optimization in Signal Processing: Beamforg and Power Control Fall 2012 Instructor: Farid Alizadeh Scribe: Shunqiao Sun 12/09/2012 1 Overview In this presentation, we study the applications

More information

Scalable and Power-Efficient Data Mining Kernels

Scalable and Power-Efficient Data Mining Kernels Scalable and Power-Efficient Data Mining Kernels Alok Choudhary, John G. Searle Professor Dept. of Electrical Engineering and Computer Science and Professor, Kellogg School of Management Director of the

More information

TR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems

TR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems TR-0-07 A Comparison of the Performance of ::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems Ang Li, Omkar Deshmukh, Radu Serban, Dan Negrut May, 0 Abstract ::GPU is a

More information

Implementing NNLO into MCFM

Implementing NNLO into MCFM Implementing NNLO into MCFM Downloadable from mcfm.fnal.gov A Multi-Threaded Version of MCFM, J.M. Campbell, R.K. Ellis, W. Giele, 2015 Higgs boson production in association with a jet at NNLO using jettiness

More information

Square Kilometre Array Science Data Challenge 1

Square Kilometre Array Science Data Challenge 1 Square Kilometre Array Science Data Challenge 1 arxiv:1811.10454v1 [astro-ph.im] 26 Nov 2018 Anna Bonaldi & Robert Braun, for the SKAO Science Team SKA Organization, Jodrell Bank, Lower Withington, Macclesfield,

More information

Welcome to MCS 572. content and organization expectations of the course. definition and classification

Welcome to MCS 572. content and organization expectations of the course. definition and classification Welcome to MCS 572 1 About the Course content and organization expectations of the course 2 Supercomputing definition and classification 3 Measuring Performance speedup and efficiency Amdahl s Law Gustafson

More information

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt** *Parallel and Distributed Architectures

More information

The Square Kilometre Array & High speed data recording

The Square Kilometre Array & High speed data recording The Square Kilometre Array & High speed Astrophysics Group, Cavendish Laboratory, University of Cambridge http://www.mrao.cam.ac.uk/ bn204/ CRISP Annual Meeting FORTH, Crete, Greece March 2012 Outline

More information

Initial deep LOFAR observations of Epoch of Reionization windows: I. The North Celestial Pole

Initial deep LOFAR observations of Epoch of Reionization windows: I. The North Celestial Pole Astronomy& Astrophysics manuscript no. ncp_eor c ESO 2013 January 11, 2013 Initial deep LOFAR observations of Epoch of Reionization windows: I. The North Celestial Pole S. Yatawatta 2, A. G. de Bruyn 1,

More information

Near Optimal Adaptive Robust Beamforming

Near Optimal Adaptive Robust Beamforming Near Optimal Adaptive Robust Beamforming The performance degradation in traditional adaptive beamformers can be attributed to the imprecise knowledge of the array steering vector and inaccurate estimation

More information

Porting a sphere optimization program from LAPACK to ScaLAPACK

Porting a sphere optimization program from LAPACK to ScaLAPACK Porting a sphere optimization program from LAPACK to ScaLAPACK Mathematical Sciences Institute, Australian National University. For presentation at Computational Techniques and Applications Conference

More information

Exascale Computing for Radio Astronomy: GPU or FPGA?

Exascale Computing for Radio Astronomy: GPU or FPGA? Exascale Computing for Radio Astronomy: GPU or FPGA? Kees van Berkel MPSoC 2016, Nara, Japan 2016, July 14 Radio Astronomy: Herculus A (a.k.a. 3C 348) optically invisible jets, one-and-a-half million light-years

More information

Shortest Lattice Vector Enumeration on Graphics Cards

Shortest Lattice Vector Enumeration on Graphics Cards Shortest Lattice Vector Enumeration on Graphics Cards Jens Hermans 1 Michael Schneider 2 Fréderik Vercauteren 1 Johannes Buchmann 2 Bart Preneel 1 1 K.U.Leuven 2 TU Darmstadt SHARCS - 10 September 2009

More information

Parallel Longest Common Subsequence using Graphics Hardware

Parallel Longest Common Subsequence using Graphics Hardware Parallel Longest Common Subsequence using Graphics Hardware John Kloetzli rian Strege Jonathan Decker Dr. Marc Olano Presented by: rian Strege 1 Overview Introduction Problem Statement ackground and Related

More information

A model leading to self-consistent iteration computation with need for HP LA (e.g, diagonalization and orthogonalization)

A model leading to self-consistent iteration computation with need for HP LA (e.g, diagonalization and orthogonalization) A model leading to self-consistent iteration computation with need for HP LA (e.g, diagonalization and orthogonalization) Schodinger equation: Hψ = Eψ Choose a basis set of wave functions Two cases: Orthonormal

More information

Dr. Andrea Bocci. Using GPUs to Accelerate Online Event Reconstruction. at the Large Hadron Collider. Applied Physicist

Dr. Andrea Bocci. Using GPUs to Accelerate Online Event Reconstruction. at the Large Hadron Collider. Applied Physicist Using GPUs to Accelerate Online Event Reconstruction at the Large Hadron Collider Dr. Andrea Bocci Applied Physicist On behalf of the CMS Collaboration Discover CERN Inside the Large Hadron Collider at

More information

Non-Closing Offsets on the VLA. R. C. Walker National Radio Astronomy Observatory Charlottesville VA.

Non-Closing Offsets on the VLA. R. C. Walker National Radio Astronomy Observatory Charlottesville VA. VLA SCIENTIFIC MEMORANDUM NO. 152 Non-Closing Offsets on the VLA R. C. Walker National Radio Astronomy Observatory Charlottesville VA. March 1984 Recent efforts to obtain very high dynamic range in VLA

More information

Sunrise: Patrik Jonsson. Panchromatic SED Models of Simulated Galaxies. Lecture 2: Working with Sunrise. Harvard-Smithsonian Center for Astrophysics

Sunrise: Patrik Jonsson. Panchromatic SED Models of Simulated Galaxies. Lecture 2: Working with Sunrise. Harvard-Smithsonian Center for Astrophysics Sunrise: Panchromatic SED Models of Simulated Galaxies Lecture 2: Working with Sunrise Patrik Jonsson Harvard-Smithsonian Center for Astrophysics Lecture outline Lecture 1: Why Sunrise? What does it do?

More information

Deconvolving Primary Beam Patterns from SKA Images

Deconvolving Primary Beam Patterns from SKA Images SKA memo 103, 14 aug 2008 Deconvolving Primary Beam Patterns from SKA Images Melvyn Wright & Stuartt Corder University of California, Berkeley, & Caltech, Pasadena, CA. ABSTRACT In this memo we present

More information

Adaptive beamforming for uniform linear arrays with unknown mutual coupling. IEEE Antennas and Wireless Propagation Letters.

Adaptive beamforming for uniform linear arrays with unknown mutual coupling. IEEE Antennas and Wireless Propagation Letters. Title Adaptive beamforming for uniform linear arrays with unknown mutual coupling Author(s) Liao, B; Chan, SC Citation IEEE Antennas And Wireless Propagation Letters, 2012, v. 11, p. 464-467 Issued Date

More information

The Lattice Boltzmann Simulation on Multi-GPU Systems

The Lattice Boltzmann Simulation on Multi-GPU Systems The Lattice Boltzmann Simulation on Multi-GPU Systems Thor Kristian Valderhaug Master of Science in Computer Science Submission date: June 2011 Supervisor: Anne Cathrine Elster, IDI Norwegian University

More information

Stochastic Modelling of Electron Transport on different HPC architectures

Stochastic Modelling of Electron Transport on different HPC architectures Stochastic Modelling of Electron Transport on different HPC architectures www.hp-see.eu E. Atanassov, T. Gurov, A. Karaivan ova Institute of Information and Communication Technologies Bulgarian Academy

More information

Very Large Array Sky Survey

Very Large Array Sky Survey 1 Very Large Array Sky Survey Technical Implementation & Test Plan Casey Law, UC Berkeley Galactic Center (Survey) Multiwavelength Image Credit: X-ray: NASA/UMass/D.Wang et al., Radio: N RAO/AUI/NSF/NRL/N.Kassim,

More information

Claude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique

Claude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique Claude Tadonki MINES ParisTech PSL Research University Centre de Recherche Informatique claude.tadonki@mines-paristech.fr Monthly CRI Seminar MINES ParisTech - CRI June 06, 2016, Fontainebleau (France)

More information

Heterogeneous programming for hybrid CPU-GPU systems: Lessons learned from computational chemistry

Heterogeneous programming for hybrid CPU-GPU systems: Lessons learned from computational chemistry Heterogeneous programming for hybrid CPU-GPU systems: Lessons learned from computational chemistry and Eugene DePrince Argonne National Laboratory (LCF and CNM) (Eugene moved to Georgia Tech last week)

More information

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS

More information

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago

More information

Study of Large-Scale Galactic Magnetic Fields at Low Frequencies. Jana Köhler - MPIfR -

Study of Large-Scale Galactic Magnetic Fields at Low Frequencies. Jana Köhler - MPIfR - Study of Large-Scale Galactic Magnetic Fields at Low Frequencies Jana Köhler - MPIfR - How to measure Magnetic Fields??? How to measure Galactic Magnetic Field? Linear Polarization of Starlight product

More information

Detection and Localization of Tones and Pulses using an Uncalibrated Array

Detection and Localization of Tones and Pulses using an Uncalibrated Array Detection and Localization of Tones and Pulses using an Uncalibrated Array Steven W. Ellingson January 24, 2002 Contents 1 Introduction 2 2 Traditional Method (BF) 2 3 Proposed Method Version 1 (FXE) 3

More information

On Portability, Performance and Scalability of a MPI OpenCL Lattice Boltzmann Code

On Portability, Performance and Scalability of a MPI OpenCL Lattice Boltzmann Code On Portability, Performance and Scalability of a MPI OpenCL Lattice Boltzmann Code E Calore, S F Schifano, R Tripiccione Enrico Calore INFN Ferrara, Italy 7 th Workshop on UnConventional High Performance

More information

A Blackbox Polynomial System Solver on Parallel Shared Memory Computers

A Blackbox Polynomial System Solver on Parallel Shared Memory Computers A Blackbox Polynomial System Solver on Parallel Shared Memory Computers Jan Verschelde University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science The 20th Workshop on

More information

Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks

Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks Mohit Shridhar Stanford University mohits@stanford.edu, mohit@u.nus.edu Abstract In particle physics, Higgs Boson to tau-tau

More information

Foregrounds for EoR 21cm signals: their properties, effects and strategies to deal with them

Foregrounds for EoR 21cm signals: their properties, effects and strategies to deal with them Foregrounds for EoR 21cm signals: their properties, effects and strategies to deal with them Ger de Bruyn Groningen Reionization meeting, 29 jun 05 ASTRON, Dwingeloo & Kapteyn Institute, Groningen Outline

More information

New Zealand Impacts on CSP and SDP Designs

New Zealand Impacts on CSP and SDP Designs New Zealand Impacts on CSP and SDP Designs Dr Andrew Ensor Director HPC Research Laboratory/ Director NZ SKA Alliance Mid-Year Mini SKA Colloquium 2018 12 July 2018 Past, Present, Future Mega-Science Projects

More information

GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications

GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications Christopher Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu University of Illinois at Urbana-Champaign

More information

The high angular resolution component of the SKA

The high angular resolution component of the SKA The high angular resolution component of the SKA Tingay, S. 1, Bignall, H. 1, Colegate, T. 1, Aben, G. 2, Nicolls, J. 2 & Weston, S. 3 Prof. Steven Tingay SKA2010 Deputy Director, ICRAR March 22 25, 2010

More information

Extracting a signal from the

Extracting a signal from the Extracting a signal from the epoch of reionization with LOFAR Geraint Harker Kapteyn Institute, Groningen, the Netherlands Collaborators: Ger de Bruyn; Michiel Brentjens, Leon Koopmans, Saleem Zaroubi;

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this

More information

Delayed and Higher-Order Transfer Entropy

Delayed and Higher-Order Transfer Entropy Delayed and Higher-Order Transfer Entropy Michael Hansen (April 23, 2011) Background Transfer entropy (TE) is an information-theoretic measure of directed information flow introduced by Thomas Schreiber

More information

Introduction to numerical computations on the GPU

Introduction to numerical computations on the GPU Introduction to numerical computations on the GPU Lucian Covaci http://lucian.covaci.org/cuda.pdf Tuesday 1 November 11 1 2 Outline: NVIDIA Tesla and Geforce video cards: architecture CUDA - C: programming

More information

S XMP LIBRARY INTERNALS. Niall Emmart University of Massachusetts. Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library

S XMP LIBRARY INTERNALS. Niall Emmart University of Massachusetts. Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library S6349 - XMP LIBRARY INTERNALS Niall Emmart University of Massachusetts Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library High Performance Modular Exponentiation A^K mod P Where A,

More information

Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method

Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method Ilya B. Labutin A.A. Trofimuk Institute of Petroleum Geology and Geophysics SB RAS, 3, acad. Koptyug Ave., Novosibirsk

More information

ECE 571 Advanced Microprocessor-Based Design Lecture 9

ECE 571 Advanced Microprocessor-Based Design Lecture 9 ECE 571 Advanced Microprocessor-Based Design Lecture 9 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 20 February 2018 Announcements HW#4 was posted. About branch predictors Don

More information

Energy-efficient Mapping of Big Data Workflows under Deadline Constraints

Energy-efficient Mapping of Big Data Workflows under Deadline Constraints Energy-efficient Mapping of Big Data Workflows under Deadline Constraints Presenter: Tong Shu Authors: Tong Shu and Prof. Chase Q. Wu Big Data Center Department of Computer Science New Jersey Institute

More information

Imaging, Deconvolution & Image Analysis I. Theory. (IRAM/Obs. de Paris) 7 th IRAM Millimeter Interferometry School Oct. 4 - Oct.

Imaging, Deconvolution & Image Analysis I. Theory. (IRAM/Obs. de Paris) 7 th IRAM Millimeter Interferometry School Oct. 4 - Oct. Imaging, Deconvolution & Image Analysis I. Theory Jérôme PETY (IRAM/Obs. de Paris) 7 th IRAM Millimeter Interferometry School Oct. 4 - Oct. 8 2010, Grenoble Scientific Analysis of a mm Interferometer Output

More information

Monte Carlo Methods for Electron Transport: Scalability Study

Monte Carlo Methods for Electron Transport: Scalability Study Monte Carlo Methods for Electron Transport: Scalability Study www.hp-see.eu Aneta Karaivanova (Joint work with E. Atanassov and T. Gurov) Institute of Information and Communication Technologies Bulgarian

More information

Pulsars with LOFAR The Low-Frequency Array

Pulsars with LOFAR The Low-Frequency Array Pulsars with LOFAR The Low-Frequency Array Ben Stappers ASTRON, Dwingeloo With assistance from Jason Hessels,, Michael Kramer, Joeri van Leeuwen and Dan Stinebring. Next generation radio telescope Telescope

More information

Mayer Sampling Monte Carlo Calculation of Virial. Coefficients on Graphics Processors

Mayer Sampling Monte Carlo Calculation of Virial. Coefficients on Graphics Processors Mayer Sampling Monte Carlo Calculation of Virial Coefficients on Graphics Processors Andrew J. Schultz, Vipin Chaudhary, David A. Kofke,, and Nathaniel S. Barlow Department of Chemical and Biological Engineering,

More information

On the Use of a Many core Processor for Computational Fluid Dynamics Simulations

On the Use of a Many core Processor for Computational Fluid Dynamics Simulations On the Use of a Many core Processor for Computational Fluid Dynamics Simulations Sebastian Raase, Tomas Nordström Halmstad University, Sweden {sebastian.raase,tomas.nordstrom} @ hh.se Preface based on

More information