Imaging using GPU. V-K Veligatla, Kapteyn Institute P. Labropoulos, ASTRON and Kapteyn Institute L. Koopmans, Kapteyn Institute
|
|
- Matthew Mills
- 5 years ago
- Views:
Transcription
1 Imaging using GPU V-K Veligatla, Kapteyn Institute P. Labropoulos, ASTRON and Kapteyn Institute L. Koopmans, Kapteyn Institute
2 Introduction What is a GPU? Why another Imager? Large amount of data to be processed. W-projection is a requirement. Issues to Consider Wide field imaging Full Stokes Speed Experiment with New approaches
3 Basic Imaging Equation V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 e 2 2 For low frequency wide-field imaging the assumption w~0 does not hold If w=0, then visibilities = FFT(image) But this is not always the case hence WProjection. (S. Bhatnagar and T.J.Cornwell) dl dm
4 W-Projection V u, v, w = I l, m B l, m 1 l m 2 2 e 2 i ul vm w 1 l m dl dm Assuming W is constant for a given plane of UV values V u, v, w = e 2 i [w 1 l m 1 ] 2 2 I l, m B l,m 1 l m 2 Correction CorrectionTerm Term 2 e 2 i ul vm dl dm
5 Block Diagram Imaging+ correcting W planes Corrected Image
6 Wproj Convolution function W=0 W=64 W=127
7 Results (Simulation) Normal Imaging W-Projected
8 Results (Simulation) Normal Imaging W-Projected
9 3C196 W-Corrected
10 Full Stokes (Sim) II QQ U U V V
11 Full Stokes 3C196 I Q U V
12 LOFAR Eor GPU Cluster Configuration 2 x Tesla M1060 (GPU) Computing capability Processing 602 MHz 4GB Memory 16 x Intel(R) Xeon(R) CPU E GHz 12 GB Ram 80 nodes
13 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager
14 Without W-Projection T i me I n S ec ond s GpuImager CasaImager
15 3C196 8 sq. degrees 256 Planes 80 Nodes 244 sub-bands In About 5 mins
16 3C196 (zoomed) 0.5 mjy rms noise at the edge of the image
17 Future Plans We are working on implementation of MVDR on the gpu. Experimenting with Weighting and Convolution schemes in the UV plane. Incorporating a very fast calibration scheme into the imager so both are done together
18 Thank You
19 Introduction What is a GPU? Why another Imager? Large amount of data to be processed. W-projection is a requirement. Issues to Consider Wide field imaging Full Stokes Speed Experiment with New approaches
20 Why use GPU Hundreds of Processors Very well suited for matrix operations. Well suited for parallel tasks. With the latest APIs (CUDA, OpenCL) easy to program.
21 Basic Imaging Equation V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 e In general W~0 (U,V,W=0). This implies visibilities = FFT(image) 2 But this is not always the case hence WProjection. (T.J.Cornwell et all) 2 dl dm
22 W-Projection V u, v, w = I l, m B l, m 1 l m i ul vm w 1 l m 1 2 e 2 dl dm Assuming W is constant for a given plane of UV values 2 i[w 1 l 2 m 2 1 ] V u, v, w = e I l, m B l, m 1 l m i ul vm e G l, m, w V u, v, w =G l, m, w V u, v, w=0 IFFT V u, v, w =G l, m, w IFFT V u, v, w=0 Ref:Tim Cornwell and Liu dl dm
23 Block Diagram Imaging+ correcting W planes Corrected Image
24 W Correction W=0 W=64 W=127
25 Results (Simulation) Normal Imaging W-Projected
26 Results (Simulation) Normal Imaging W-Projected
27 3C196 W-Corrected
28 Full Stokes (Sim) II QQ U U V V
29 Full Stokes 3C196 I Q U V
30 LOFAR Eor GPU Cluster Configuration 2 x Tesla M1060 (GPU) Compute 1.3 capable MHz 4GB Memory 16 x Intel(R) Xeon(R) CPU 12 GB Ram We have 80 Such Nodes
31 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager
32 Without W-Projection T i me I n S ec ond s GpuImager CasaImager
33 3C196 GPU Imager W-Projection 256 Planes 80 Nodes In About 5 mins
34 3C196 (zoomed)
35 LOFAR Eor GPU Cluster Configuration 2 x Tesla C1060 (GPU) Compute 1.3 capable MHz 4GB Memory 8 x Intel(R) Xeon(R) CPU 12 GB Ram We have 80 Such Nodes
36 Results (Simulation) Normal Imaging W-Projected
37 3C196 W-Corrected
38 W-Projection Performance Planes 128 Planes 256 Planes 200 T i me I n S ec ond s GpuImager CasaImager
39 3C196 GPU Imager W-Projection 256 Planes 80 Nodes In About 5 mins
40 Minimum Variance Distortionless Response Minimizes side lobe gains (cause beam to vary spatially) Computationally intensive Parallel computation possible. Dirty Image Equation: H I k l, m = A k l, m R k A k l, m I k l, m Pixel Intensity at l,m at time instance k A k l, m Antenna steering vector Rk Array correlation matrix (ACM).
41 MVDR The weights in dirty image A(l,m) are replaced with W(l,m) Weight set to minimize influence of interfering sources. W =argmin w W H R W Under the constraint W H A=1 H 1 A R H W = H 1 A R A H I k l, m =W R W Ref: R. Levanda and Amir Leshem, Liu at al., Stoica et al.
42 Challenges H 1 A R W = H 1 A R A H Inverse of Visibility Matrix ACM close to singular. Diagonal loading, (non-linear) dimensionality reduction L-curve type of analysis to select diagonal loading strength
43 Challenges: beamformer power vs time itle:/home/vkrishna/documents/experi/ Creator:MATLAB, The MathWorks, Inc. Vers CreationDate:07/07/ :17:26 anguagelevel:2 Flagged/missing data => Higher Diagonal Loading Factor Higher regularization factor => higher the noise in that epoch Cholesky Factorization to determine level of Regularization.
44 Noisy data
45 Results
46 CPU vs GPU CPU vs GPU 350 GPU CPU 300 Threads equivalent to #processors on system Each Thread processes part of final image Time in Minutes CPU implementation: GPU Implementation: Multiple threads process a single pixel Each thread calculates component of vector Size of Image 2048
47 Optimization Varying Block Size Varying Register Count Multiprocessor Warp Occupancy Multiprocessor Warp Occupancy Registers Per Thread Threads Per Block
48
49
50
51
52
53 Summary Since we produce snapshots we can also correct them for image plane effects on the fly Currently implemented a station beam library Highly scalable New methods of Imaging can be explored
54 Thank You
Array signal processing for radio-astronomy imaging and future radio telescopes
Array signal processing for radio-astronomy imaging and future radio telescopes School of Engineering Bar-Ilan University Amir Leshem Faculty of EEMCS Delft University of Technology Joint wor with Alle-Jan
More informationJacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA
Jacobi-Based Eigenvalue Solver on GPU Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline Symmetric eigenvalue solver Experiment Applications Conclusions Symmetric eigenvalue solver The standard form is
More informationFrom LOFAR to SKA, challenges in distributed computing. Soobash Daiboo Paris Observatory -LESIA
From LOFAR to SKA, challenges in distributed computing Soobash Daiboo Paris Observatory -LESIA Overview LOFAR telescope Data processing with LOFAR NenuFar SKA Summary LOFAR science drivers Key science
More informationMultiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU
Multiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU Khramtsov D.P., Nekrasov D.A., Pokusaev B.G. Department of Thermodynamics, Thermal Engineering and Energy Saving Technologies,
More informationConfusion, the sky model and realistic simulations
Confusion, the sky model and realistic simulations Randall Wayth (CfA) with team Greenhill (Greenhill, Mitchell, Sault) team MAPS (Doeleman, Bhat) Ashes Update: England all out for 350. Australia wins
More informationThe Probability Distribution of the MVDR Beamformer Outputs under Diagonal Loading. N. Raj Rao (Dept. of Electrical Engineering and Computer Science)
The Probability Distribution of the MVDR Beamformer Outputs under Diagonal Loading N. Raj Rao (Dept. of Electrical Engineering and Computer Science) & Alan Edelman (Dept. of Mathematics) Work supported
More informationWide-Field Imaging: I
Wide-Field Imaging: I S. Bhatnagar NRAO, Socorro Twelfth Synthesis Imaging Workshop 010 June 8-15 Wide-field imaging What do we mean by wide-field imaging W-Term: D Fourier transform approximation breaks
More informationStatistical inversion of the LOFAR Epoch of Reionization experiment data model
Statistical inversion of the LOFAR Epoch of Reionization experiment data model ASTRON, Oude Hoogeveensedijk 4, 7991 PD, Dwingeloo, the Netherlands Kapteyn Astronomical Institute, Landleven 12, 9747 AD,
More informationA Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters
A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!
More informationLOFAR Polarization Commissioning by the MKSP
LOFAR Polarization Commissioning by the MKSP James M Anderson anderson@mpifr-bonn.mpg.de On behalf of the LOFAR collaboration Surveys KSP Meeting, Leiden, 2010 March 11 James M Anderson 1/15 Timeline 2010
More informationALMA Development Program
ALMA Development Program Jeff Kern CASA Team Lead Atacama Large Millimeter/submillimeter Array Expanded Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array Opportunities for Software
More informationDynamic Scheduling for Work Agglomeration on Heterogeneous Clusters
Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Jonathan Lifflander, G. Carl Evans, Anshu Arya, Laxmikant Kale University of Illinois Urbana-Champaign May 25, 2012 Work is overdecomposed
More informationBeamspace Adaptive Beamforming and the GSC
April 27, 2011 Overview The MVDR beamformer: performance and behavior. Generalized Sidelobe Canceller reformulation. Implementation of the GSC. Beamspace interpretation of the GSC. Reduced complexity of
More informationRecent Progress of Parallel SAMCEF with MUMPS MUMPS User Group Meeting 2013
Recent Progress of Parallel SAMCEF with User Group Meeting 213 Jean-Pierre Delsemme Product Development Manager Summary SAMCEF, a brief history Co-simulation, a good candidate for parallel processing MAAXIMUS,
More informationComputing least squares condition numbers on hybrid multicore/gpu systems
Computing least squares condition numbers on hybrid multicore/gpu systems M. Baboulin and J. Dongarra and R. Lacroix Abstract This paper presents an efficient computation for least squares conditioning
More informationMulticore Parallelization of Determinant Quantum Monte Carlo Simulations
Multicore Parallelization of Determinant Quantum Monte Carlo Simulations Andrés Tomás, Che-Rung Lee, Zhaojun Bai, Richard Scalettar UC Davis SIAM Conference on Computation Science & Engineering Reno, March
More informationStatistics of the Universe: Exa-calculations and Cosmology's Data Deluge
Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Debbie Bard Matt Bellis Cosmology: the study of the nature and history of the Universe History of Universe driven by competing forces:
More informationLow-frequency radio astronomy and wide-field imaging
Low-frequency radio astronomy and wide-field imaging James Miller-Jones (NRAO Charlottesville/Curtin University) ITN 215212: Black Hole Universe Many slides taken from NRAO Synthesis Imaging Workshop (Tracy
More informationData analysis of massive data sets a Planck example
Data analysis of massive data sets a Planck example Radek Stompor (APC) LOFAR workshop, Meudon, 29/03/06 Outline 1. Planck mission; 2. Planck data set; 3. Planck data analysis plan and challenges; 4. Planck
More informationRobust Capon Beamforming
Robust Capon Beamforming Yi Jiang Petre Stoica Zhisong Wang Jian Li University of Florida Uppsala University University of Florida University of Florida March 11, 2003 ASAP Workshop 2003 1 Outline Standard
More informationMulti-GPU Simulations of the Infinite Universe
() Multi-GPU of the Infinite with with G. Rácz, I. Szapudi & L. Dobos Physics of Complex Systems Department Eötvös Loránd University, Budapest June 22, 2018, Budapest, Hungary Outline 1 () 2 () Concordance
More informationAccelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers
UT College of Engineering Tutorial Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers Stan Tomov 1, George Bosilca 1, and Cédric
More informationParametric high resolution techniques for radio astronomical imaging
Parametric high resolution techniques for radio astronomical imaging 1 arxiv:0807.4852v1 [astro-ph] 30 Jul 2008 Chen Ben-David and Amir Leshem (Senior Member) School of Engineering Bar-Ilan University
More informationCHAPTER 3 ROBUST ADAPTIVE BEAMFORMING
50 CHAPTER 3 ROBUST ADAPTIVE BEAMFORMING 3.1 INTRODUCTION Adaptive beamforming is used for enhancing a desired signal while suppressing noise and interference at the output of an array of sensors. It is
More informationACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS
ACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS Bojan Musizza, Dejan Petelin, Juš Kocijan, Jožef Stefan Institute Jamova 39, Ljubljana, Slovenia University of Nova Gorica Vipavska 3, Nova Gorica, Slovenia
More informationS0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA
S0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA Date: 16th May 2012 Wed, 3pm to 3.25pm(Adv. Session) Sathyanarayana K., Manish Banga, and Ravi Kumar G. V. V. Engineering Services,
More informationJulian Merten. GPU Computing and Alternative Architecture
Future Directions of Cosmological Simulations / Edinburgh 1 / 16 Julian Merten GPU Computing and Alternative Architecture Institut für Theoretische Astrophysik Zentrum für Astronomie Universität Heidelberg
More informationA ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT
Progress In Electromagnetics Research Letters, Vol. 16, 53 60, 2010 A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT Y. P. Liu and Q. Wan School of Electronic Engineering University of Electronic
More informationarxiv: v1 [astro-ph.im] 16 Apr 2009
Closed form solution of the maximum entropy equations with application to fast radio astronomical image formation arxiv:0904.2545v1 [astro-ph.im] 16 Apr 2009 Amir Leshem 1 School of Engineering, Bar-Ilan
More informationCOMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD
XVIII International Conference on Water Resources CMWR 2010 J. Carrera (Ed) c CIMNE, Barcelona, 2010 COMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD James.E. McClure, Jan F. Prins
More informationFuture Improvements of Weather and Climate Prediction
Future Improvements of Weather and Climate Prediction Unidata Policy Committee October 21, 2010 Alexander E. MacDonald, Ph.D. Deputy Assistant Administrator for Labs and Cooperative Institutes & Director,
More informationReal-time signal detection for pulsars and radio transients using GPUs
Real-time signal detection for pulsars and radio transients using GPUs W. Armour, M. Giles, A. Karastergiou and C. Williams. University of Oxford. 15 th July 2013 1 Background of GPUs Why use GPUs? Influence
More informationRadio Astronomical Image Formation using Constrained Least Squares and Krylov Subspaces
Radio Astronomical Image Formation using Constrained Least Squares and Krylov Subspaces A. Mouri Sardarabadi, Amir Leshem 2, Alle-Jan van der Veen arxiv:55.348v [astro-ph.im] 6 May 25 Abstract Image formation
More informationROBUST ADAPTIVE BEAMFORMING BASED ON CO- VARIANCE MATRIX RECONSTRUCTION FOR LOOK DIRECTION MISMATCH
Progress In Electromagnetics Research Letters, Vol. 25, 37 46, 2011 ROBUST ADAPTIVE BEAMFORMING BASED ON CO- VARIANCE MATRIX RECONSTRUCTION FOR LOOK DIRECTION MISMATCH R. Mallipeddi 1, J. P. Lie 2, S.
More informationQuality Report Generated with Pix4Dmapper Pro version
Quality Report Generated with Pix4Dmapper Pro version 3.0.13 Important: Click on the different icons for: Help to analyze the results in the Quality Report Additional information about the sections Click
More informationGerman LOFAR a new era in radio astronomy
German LOFAR a new era in radio astronomy Marcus Brüggen International University Bremen Rainer Beck (MPIfR) Heino Falcke (ASTRON) Anton Zensus (MPIfR) Köln, 28. Sep 2005 Outline Why LOFAR? What is happening
More informationarxiv: v1 [hep-lat] 7 Oct 2010
arxiv:.486v [hep-lat] 7 Oct 2 Nuno Cardoso CFTP, Instituto Superior Técnico E-mail: nunocardoso@cftp.ist.utl.pt Pedro Bicudo CFTP, Instituto Superior Técnico E-mail: bicudo@ist.utl.pt We discuss the CUDA
More informationPSEUDORANDOM numbers are very important in practice
Proceedings of the Federated Conference on Computer Science and Information Systems pp 571 578 ISBN 978-83-681-51-4 Parallel GPU-accelerated Recursion-based Generators of Pseudorandom Numbers Przemysław
More informationDirect Self-Consistent Field Computations on GPU Clusters
Direct Self-Consistent Field Computations on GPU Clusters Guochun Shi, Volodymyr Kindratenko National Center for Supercomputing Applications University of Illinois at UrbanaChampaign Ivan Ufimtsev, Todd
More informationLarge Field of View Radio Astronomy; relevant for many KSP s
Large Field of View Radio Astronomy; relevant for many KSP s SKA Key Science Drivers ORIGINS Probing the Dark Ages When & how were the first stars formed? Cosmology and Galaxy Evolution Galaxies, Dark
More informationParametric High Resolution Techniques for Radio Astronomical Imaging Chen Ben-David and Amir Leshem, Senior Member, IEEE
670 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 2, NO. 5, OCTOBER 2008 Parametric High Resolution Techniques for Radio Astronomical Imaging Chen Ben-David and Amir Leshem, Senior Member,
More informationADAPTIVE ANTENNAS. SPATIAL BF
ADAPTIVE ANTENNAS SPATIAL BF 1 1-Spatial reference BF -Spatial reference beamforming may not use of embedded training sequences. Instead, the directions of arrival (DoA) of the impinging waves are used
More informationA CUDA Solver for Helmholtz Equation
Journal of Computational Information Systems 11: 24 (2015) 7805 7812 Available at http://www.jofcis.com A CUDA Solver for Helmholtz Equation Mingming REN 1,2,, Xiaoguang LIU 1,2, Gang WANG 1,2 1 College
More informationComputation of Large Sparse Aggregated Areas for Analytic Database Queries
Computation of Large Sparse Aggregated Areas for Analytic Database Queries Steffen Wittmer Tobias Lauer Jedox AG Collaborators: Zurab Khadikov Alexander Haberstroh Peter Strohm Business Intelligence and
More informationMultivariate density estimation and its applications
Multivariate density estimation and its applications Wing Hung Wong Stanford University June 2014, Madison, Wisconsin. Conference in honor of the 80 th birthday of Professor Grace Wahba Lessons I learned
More informationSP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay
SP-CNN: A Scalable and Programmable CNN-based Accelerator Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay Motivation Power is a first-order design constraint, especially for embedded devices. Certain
More informationBehavioral Simulations in MapReduce
Behavioral Simulations in MapReduce Guozhang Wang, Marcos Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan Demers, Johannes Gehrke, Walker White Cornell University 1 What are Behavioral Simulations?
More informationCosmology with Galaxy Clusters: Observations meet High-Performance-Computing
Cosmology with Galaxy Clusters: Observations meet High-Performance-Computing Julian Merten (ITA/ZAH) Clusters of galaxies GPU lensing codes Abell 2744 CLASH: A HST/MCT programme Clusters of galaxies DM
More informationDetermining the Specification of an Aperture Array for Cosmological Surveys
Determining the Specification of an Aperture Array for Cosmological Surveys P. Alexander University of Cambridge, Department of Physics, Cavendish Laboratory, Cambridge, UK. A. J. aulkner Jodrell Bank
More informationMulti-frequency imaging of Cygnus A with LOFAR
Netherlands Institute for Radio Astronomy Multi-frequency imaging of Cygnus A with LOFAR John McKean (ASTRON) and all the imaging busy week team! ASTRON is part of the Netherlands Organisation for Scientific
More informationMachine Learning for Gravitational Wave signals classification in LIGO and Virgo
Machine Learning for Gravitational Wave signals classification in LIGO and Virgo Elena Cuoco European Gravitational Observatory www.elenacuoco.com @elenacuoco 2 About me About me Working as Data Analyst
More informationApplications of Robust Optimization in Signal Processing: Beamforming and Power Control Fall 2012
Applications of Robust Optimization in Signal Processing: Beamforg and Power Control Fall 2012 Instructor: Farid Alizadeh Scribe: Shunqiao Sun 12/09/2012 1 Overview In this presentation, we study the applications
More informationScalable and Power-Efficient Data Mining Kernels
Scalable and Power-Efficient Data Mining Kernels Alok Choudhary, John G. Searle Professor Dept. of Electrical Engineering and Computer Science and Professor, Kellogg School of Management Director of the
More informationTR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems
TR-0-07 A Comparison of the Performance of ::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems Ang Li, Omkar Deshmukh, Radu Serban, Dan Negrut May, 0 Abstract ::GPU is a
More informationImplementing NNLO into MCFM
Implementing NNLO into MCFM Downloadable from mcfm.fnal.gov A Multi-Threaded Version of MCFM, J.M. Campbell, R.K. Ellis, W. Giele, 2015 Higgs boson production in association with a jet at NNLO using jettiness
More informationSquare Kilometre Array Science Data Challenge 1
Square Kilometre Array Science Data Challenge 1 arxiv:1811.10454v1 [astro-ph.im] 26 Nov 2018 Anna Bonaldi & Robert Braun, for the SKAO Science Team SKA Organization, Jodrell Bank, Lower Withington, Macclesfield,
More informationWelcome to MCS 572. content and organization expectations of the course. definition and classification
Welcome to MCS 572 1 About the Course content and organization expectations of the course 2 Supercomputing definition and classification 3 Measuring Performance speedup and efficiency Amdahl s Law Gustafson
More informationHybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS
Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt** *Parallel and Distributed Architectures
More informationThe Square Kilometre Array & High speed data recording
The Square Kilometre Array & High speed Astrophysics Group, Cavendish Laboratory, University of Cambridge http://www.mrao.cam.ac.uk/ bn204/ CRISP Annual Meeting FORTH, Crete, Greece March 2012 Outline
More informationInitial deep LOFAR observations of Epoch of Reionization windows: I. The North Celestial Pole
Astronomy& Astrophysics manuscript no. ncp_eor c ESO 2013 January 11, 2013 Initial deep LOFAR observations of Epoch of Reionization windows: I. The North Celestial Pole S. Yatawatta 2, A. G. de Bruyn 1,
More informationNear Optimal Adaptive Robust Beamforming
Near Optimal Adaptive Robust Beamforming The performance degradation in traditional adaptive beamformers can be attributed to the imprecise knowledge of the array steering vector and inaccurate estimation
More informationPorting a sphere optimization program from LAPACK to ScaLAPACK
Porting a sphere optimization program from LAPACK to ScaLAPACK Mathematical Sciences Institute, Australian National University. For presentation at Computational Techniques and Applications Conference
More informationExascale Computing for Radio Astronomy: GPU or FPGA?
Exascale Computing for Radio Astronomy: GPU or FPGA? Kees van Berkel MPSoC 2016, Nara, Japan 2016, July 14 Radio Astronomy: Herculus A (a.k.a. 3C 348) optically invisible jets, one-and-a-half million light-years
More informationShortest Lattice Vector Enumeration on Graphics Cards
Shortest Lattice Vector Enumeration on Graphics Cards Jens Hermans 1 Michael Schneider 2 Fréderik Vercauteren 1 Johannes Buchmann 2 Bart Preneel 1 1 K.U.Leuven 2 TU Darmstadt SHARCS - 10 September 2009
More informationParallel Longest Common Subsequence using Graphics Hardware
Parallel Longest Common Subsequence using Graphics Hardware John Kloetzli rian Strege Jonathan Decker Dr. Marc Olano Presented by: rian Strege 1 Overview Introduction Problem Statement ackground and Related
More informationA model leading to self-consistent iteration computation with need for HP LA (e.g, diagonalization and orthogonalization)
A model leading to self-consistent iteration computation with need for HP LA (e.g, diagonalization and orthogonalization) Schodinger equation: Hψ = Eψ Choose a basis set of wave functions Two cases: Orthonormal
More informationDr. Andrea Bocci. Using GPUs to Accelerate Online Event Reconstruction. at the Large Hadron Collider. Applied Physicist
Using GPUs to Accelerate Online Event Reconstruction at the Large Hadron Collider Dr. Andrea Bocci Applied Physicist On behalf of the CMS Collaboration Discover CERN Inside the Large Hadron Collider at
More informationNon-Closing Offsets on the VLA. R. C. Walker National Radio Astronomy Observatory Charlottesville VA.
VLA SCIENTIFIC MEMORANDUM NO. 152 Non-Closing Offsets on the VLA R. C. Walker National Radio Astronomy Observatory Charlottesville VA. March 1984 Recent efforts to obtain very high dynamic range in VLA
More informationSunrise: Patrik Jonsson. Panchromatic SED Models of Simulated Galaxies. Lecture 2: Working with Sunrise. Harvard-Smithsonian Center for Astrophysics
Sunrise: Panchromatic SED Models of Simulated Galaxies Lecture 2: Working with Sunrise Patrik Jonsson Harvard-Smithsonian Center for Astrophysics Lecture outline Lecture 1: Why Sunrise? What does it do?
More informationDeconvolving Primary Beam Patterns from SKA Images
SKA memo 103, 14 aug 2008 Deconvolving Primary Beam Patterns from SKA Images Melvyn Wright & Stuartt Corder University of California, Berkeley, & Caltech, Pasadena, CA. ABSTRACT In this memo we present
More informationAdaptive beamforming for uniform linear arrays with unknown mutual coupling. IEEE Antennas and Wireless Propagation Letters.
Title Adaptive beamforming for uniform linear arrays with unknown mutual coupling Author(s) Liao, B; Chan, SC Citation IEEE Antennas And Wireless Propagation Letters, 2012, v. 11, p. 464-467 Issued Date
More informationThe Lattice Boltzmann Simulation on Multi-GPU Systems
The Lattice Boltzmann Simulation on Multi-GPU Systems Thor Kristian Valderhaug Master of Science in Computer Science Submission date: June 2011 Supervisor: Anne Cathrine Elster, IDI Norwegian University
More informationStochastic Modelling of Electron Transport on different HPC architectures
Stochastic Modelling of Electron Transport on different HPC architectures www.hp-see.eu E. Atanassov, T. Gurov, A. Karaivan ova Institute of Information and Communication Technologies Bulgarian Academy
More informationVery Large Array Sky Survey
1 Very Large Array Sky Survey Technical Implementation & Test Plan Casey Law, UC Berkeley Galactic Center (Survey) Multiwavelength Image Credit: X-ray: NASA/UMass/D.Wang et al., Radio: N RAO/AUI/NSF/NRL/N.Kassim,
More informationClaude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique
Claude Tadonki MINES ParisTech PSL Research University Centre de Recherche Informatique claude.tadonki@mines-paristech.fr Monthly CRI Seminar MINES ParisTech - CRI June 06, 2016, Fontainebleau (France)
More informationHeterogeneous programming for hybrid CPU-GPU systems: Lessons learned from computational chemistry
Heterogeneous programming for hybrid CPU-GPU systems: Lessons learned from computational chemistry and Eugene DePrince Argonne National Laboratory (LCF and CNM) (Eugene moved to Georgia Tech last week)
More informationSPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics
SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS
More informationGPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic
GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago
More informationStudy of Large-Scale Galactic Magnetic Fields at Low Frequencies. Jana Köhler - MPIfR -
Study of Large-Scale Galactic Magnetic Fields at Low Frequencies Jana Köhler - MPIfR - How to measure Magnetic Fields??? How to measure Galactic Magnetic Field? Linear Polarization of Starlight product
More informationDetection and Localization of Tones and Pulses using an Uncalibrated Array
Detection and Localization of Tones and Pulses using an Uncalibrated Array Steven W. Ellingson January 24, 2002 Contents 1 Introduction 2 2 Traditional Method (BF) 2 3 Proposed Method Version 1 (FXE) 3
More informationOn Portability, Performance and Scalability of a MPI OpenCL Lattice Boltzmann Code
On Portability, Performance and Scalability of a MPI OpenCL Lattice Boltzmann Code E Calore, S F Schifano, R Tripiccione Enrico Calore INFN Ferrara, Italy 7 th Workshop on UnConventional High Performance
More informationA Blackbox Polynomial System Solver on Parallel Shared Memory Computers
A Blackbox Polynomial System Solver on Parallel Shared Memory Computers Jan Verschelde University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science The 20th Workshop on
More informationClassification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks
Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks Mohit Shridhar Stanford University mohits@stanford.edu, mohit@u.nus.edu Abstract In particle physics, Higgs Boson to tau-tau
More informationForegrounds for EoR 21cm signals: their properties, effects and strategies to deal with them
Foregrounds for EoR 21cm signals: their properties, effects and strategies to deal with them Ger de Bruyn Groningen Reionization meeting, 29 jun 05 ASTRON, Dwingeloo & Kapteyn Institute, Groningen Outline
More informationNew Zealand Impacts on CSP and SDP Designs
New Zealand Impacts on CSP and SDP Designs Dr Andrew Ensor Director HPC Research Laboratory/ Director NZ SKA Alliance Mid-Year Mini SKA Colloquium 2018 12 July 2018 Past, Present, Future Mega-Science Projects
More informationGPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications
GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications Christopher Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu University of Illinois at Urbana-Champaign
More informationThe high angular resolution component of the SKA
The high angular resolution component of the SKA Tingay, S. 1, Bignall, H. 1, Colegate, T. 1, Aben, G. 2, Nicolls, J. 2 & Weston, S. 3 Prof. Steven Tingay SKA2010 Deputy Director, ICRAR March 22 25, 2010
More informationExtracting a signal from the
Extracting a signal from the epoch of reionization with LOFAR Geraint Harker Kapteyn Institute, Groningen, the Netherlands Collaborators: Ger de Bruyn; Michiel Brentjens, Leon Koopmans, Saleem Zaroubi;
More informationPDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this
More informationDelayed and Higher-Order Transfer Entropy
Delayed and Higher-Order Transfer Entropy Michael Hansen (April 23, 2011) Background Transfer entropy (TE) is an information-theoretic measure of directed information flow introduced by Thomas Schreiber
More informationIntroduction to numerical computations on the GPU
Introduction to numerical computations on the GPU Lucian Covaci http://lucian.covaci.org/cuda.pdf Tuesday 1 November 11 1 2 Outline: NVIDIA Tesla and Geforce video cards: architecture CUDA - C: programming
More informationS XMP LIBRARY INTERNALS. Niall Emmart University of Massachusetts. Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library
S6349 - XMP LIBRARY INTERNALS Niall Emmart University of Massachusetts Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library High Performance Modular Exponentiation A^K mod P Where A,
More informationAlgorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method
Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method Ilya B. Labutin A.A. Trofimuk Institute of Petroleum Geology and Geophysics SB RAS, 3, acad. Koptyug Ave., Novosibirsk
More informationECE 571 Advanced Microprocessor-Based Design Lecture 9
ECE 571 Advanced Microprocessor-Based Design Lecture 9 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 20 February 2018 Announcements HW#4 was posted. About branch predictors Don
More informationEnergy-efficient Mapping of Big Data Workflows under Deadline Constraints
Energy-efficient Mapping of Big Data Workflows under Deadline Constraints Presenter: Tong Shu Authors: Tong Shu and Prof. Chase Q. Wu Big Data Center Department of Computer Science New Jersey Institute
More informationImaging, Deconvolution & Image Analysis I. Theory. (IRAM/Obs. de Paris) 7 th IRAM Millimeter Interferometry School Oct. 4 - Oct.
Imaging, Deconvolution & Image Analysis I. Theory Jérôme PETY (IRAM/Obs. de Paris) 7 th IRAM Millimeter Interferometry School Oct. 4 - Oct. 8 2010, Grenoble Scientific Analysis of a mm Interferometer Output
More informationMonte Carlo Methods for Electron Transport: Scalability Study
Monte Carlo Methods for Electron Transport: Scalability Study www.hp-see.eu Aneta Karaivanova (Joint work with E. Atanassov and T. Gurov) Institute of Information and Communication Technologies Bulgarian
More informationPulsars with LOFAR The Low-Frequency Array
Pulsars with LOFAR The Low-Frequency Array Ben Stappers ASTRON, Dwingeloo With assistance from Jason Hessels,, Michael Kramer, Joeri van Leeuwen and Dan Stinebring. Next generation radio telescope Telescope
More informationMayer Sampling Monte Carlo Calculation of Virial. Coefficients on Graphics Processors
Mayer Sampling Monte Carlo Calculation of Virial Coefficients on Graphics Processors Andrew J. Schultz, Vipin Chaudhary, David A. Kofke,, and Nathaniel S. Barlow Department of Chemical and Biological Engineering,
More informationOn the Use of a Many core Processor for Computational Fluid Dynamics Simulations
On the Use of a Many core Processor for Computational Fluid Dynamics Simulations Sebastian Raase, Tomas Nordström Halmstad University, Sweden {sebastian.raase,tomas.nordstrom} @ hh.se Preface based on
More information