Data analysis of massive data sets a Planck example

Similar documents
Big Bang, Big Iron: CMB Data Analysis at the Petascale and Beyond

Planck. Report on the status of the mission Carlo Baccigalupi, SISSA

Science exploitation of CMB data. Paolo Natoli Università di Ferrara and INFN on behalf of the Italian CMB community

Physics of CMB Polarization and Its Measurement

Planck 2014 The Microwave Sky in Temperature and Polarisation Ferrara, 1 5 December The Planck mission

Forthcoming CMB experiments and expectations for dark energy. Carlo Baccigalupi

The CMB sky observed at 43 and 95 GHz with QUIET

LFI frequency maps: data analysis, results and future challenges

Cosmology & CMB. Set5: Data Analysis. Davide Maino

THE PLANCK MISSION The most accurate measurement of the oldest electromagnetic radiation in the Universe

The first light in the universe

Cosmology Large Angular Scale Surveyor. Wednesday, September 25, 13

CMB studies with Planck

Constraints on inflationary models of the Universe based on CMB data

The international scenario Balloons, LiteBIRD, PIXIE, Millimetron

Power spectrum exercise

The microwave sky as seen by Planck

Microwave Background Polarization: Theoretical Perspectives

Planck 2015 parameter constraints

What I heard about Planck Eiichiro Komatsu (MPA) December 9, 2014 December 22, 2014

arxiv: v1 [astro-ph.co] 3 Feb 2016

CMB polarization and cosmology

Measurements of Degree-Scale B-mode Polarization with the BICEP/Keck Experiments at South Pole

Cosmology with CMB: the perturbed universe

Short course: 101 level introductory course to provide a foundation for everyone coming to the workshop to understand the field.

Testing parity violation with the CMB

Examining the Effect of the Map-Making Algorithm on Observed Power Asymmetry in WMAP Data

PLANCK lately and beyond

Planck and Virtual Observatories: Far Infra-red / Sub-mm Specificities

Gravitational Lensing of the CMB

Observational Cosmology

Precise measurement of CMB polarisation from Dome-C: the BRAIN and CLOVER experiments

Instrumental Systematics on Lensing Reconstruction and primordial CMB B-mode Diagnostics. Speaker: Meng Su. Harvard University

Really, really, what universe do we live in?

SPIDER: A Balloon-Borne Polarimeter for Measuring Large Angular Scale CMB B-modes

Michel Piat for the BRAIN collaboration

Cosmology with CMB & LSS:

Toward an Understanding of Foregrounds in the BICEP2 Region

Exploring the primordial Universe with QUBIC

ALMA Development Program

Herschel and Planck: ESA s New Astronomy Missions an introduction. Martin Kessler Schloss Braunshardt 19/03/2009

Planck. Ken Ganga. APC/CNRS/ University of Paris-Diderot

Introduction to COSMOMC

The ultimate measurement of the CMB temperature anisotropy field UNVEILING THE CMB SKY

QUaD and Development for Future CMB Experiments

PILOT. A far-infrared balloon-borne polarization experiment. Jonathan Aumont IRAP Toulouse, France

Using training sets and SVD to separate global 21-cm signal from foreground and instrument systematics

Cosmic Microwave Background

OVERVIEW OF NEW CMB RESULTS

Challenges in Cosmic Microwave Background Data Analysis

Anisotropy in the CMB

arxiv:astro-ph/ v1 9 Dec 2001

News from BICEP/Keck Array CMB telescopes

Primordial B-modes: Foreground modelling and constraints

Consistency tests for Planck and WMAP in the low multipole domain

Constraining the topology of the Universe using CMB maps

The Quest for Gravity Wave B-modes

Primordial gravitational waves detected? Atsushi Taruya

DES Galaxy Clusters x Planck SZ Map. ASTR 448 Kuang Wei Nov 27

Cosmic Microwave Background

Algorithms for Higher Order Spatial Statistics

The Millennium Simulation: cosmic evolution in a supercomputer. Simon White Max Planck Institute for Astrophysics

CMB interferometry (20 April 2012)

CMB Polarization and Cosmology

Primordial Gravitational waves and the polarization of the CMB. José Alberto Rubiño Martín (IAC, Tenerife)

QUIET Experiment and HEMT receiver array

CMB Polarization Experiments: Status and Prospects. Kuo Assistant Professor of Physics Stanford University, SLAC

Polarization of the Cosmic Microwave Background: Are Those Guys Serious? Al Kogut Goddard Space Flight Center

Imprint of Scalar Dark Energy on CMB polarization

Polarised synchrotron simulations for EoR experiments

Neutrinos in the era of precision Cosmology

Big-Data as a Challenge for Astrophysics

Galaxies 626. Lecture 3: From the CMBR to the first star

Statistical inversion of the LOFAR Epoch of Reionization experiment data model

Cosmology after Planck

Constraints on primordial abundances and neutron life-time from CMB

arxiv: v1 [astro-ph.co] 8 Oct 2010

Constraining the redshift evolution of the Cosmic Microwave Background black-body temperature with PLANCK data

COSMIC MICROWAVE BACKGROUND ANISOTROPIES

D une masse de données à une poignée de paramètres cosmologiques: quelques aperçus sur le traitement des données du satellite Planck.

FOUR-YEAR COBE 1 DMR COSMIC MICROWAVE BACKGROUND OBSERVATIONS: MAPS AND BASIC RESULTS

Planck was conceived to confirm the robustness of the ΛCDM concordance model when the relevant quantities are measured with much higher accuracy

Foregrounds for observations of the high redshift global 21 cm signal

CMB Lensing Reconstruction on PLANCK simulated data

Optimizing CMB Polarization Experiments to Constrain Inflationary Physics

Gravitational Waves and the Microwave Background

The cosmic background radiation II: The WMAP results. Alexander Schmah

INDEPENDENT COMPONENT ANALYSIS OF THE COSMIC MICROWAVE BACKGROUND

Delensing CMB B-modes: results from SPT.

CMB Foreground and separation methods. Astro 448 Yuxi Zhao 12/5/2018

Probing Early Universe Physics with Observations of the Cosmic Microwave Background. Sarah Church KIPAC, Stanford University

MoBiKID Kinetic Inductance Detectors for up-coming B-mode satellite experiments

Cosmological Studies with SZE-determined Peculiar Velocities. Sarah Church Stanford University

An Introduction to Radio Astronomy

Multiphysics Simulation of Polymer-Based Filters for Sub-Millimetre Space Optics

Cosmic Microwave Background Anisotropy

4.3 Planck. Scientific goals. Planck payload

PoS(MULTIF2017)017. Highlights of Planck results

Wavelets Applied to CMB Analysis

PICO - Probe of Inflation and Cosmic Origins

Transcription:

Data analysis of massive data sets a Planck example Radek Stompor (APC) LOFAR workshop, Meudon, 29/03/06

Outline 1. Planck mission; 2. Planck data set; 3. Planck data analysis plan and challenges; 4. Planck vs interferometers; 5. CMB work at APC; 6. Conclusions. 2

The Planck Satellite An ESA mission, with a significant contribution from NASA, due to launch in fall 2007. Will carry on board 72 detectors shared between two High (46) and Low Frequency (26) Instruments. An 12+ month all-sky survey at 9 microwave frequencies from 30 to 857 GHz. O(10 12 ) observations; O(10 8 ) sky pixels; O(10 3 ) spectral multipoles. 3

The Planck data set - the sizes Raw data i. detector readouts; ii. pointing information; iii. household data; all amounting to roughly ~ 2-5 Tbytes of the data. Derived products (including mission deliverables) i. systematic templates, noise filters, etc ii. single frequency maps; iii. foreground component maps; iv. power spectra and their covariances; v. cosmological parameter likelihoods, etc. Temporary objects i. simulated data corresponding to various stages of the analysis all amounting to roughly 100 x raw data 200-500 Tbytes of the data stored and in a need of frequent processing at a peak of the data analysis activities. 4

The Planck data set - the characteristics Mostly low signal-to-noise data (100 uk sec 1500 uk) with a trend set by a cosmological dipole ( ~3mK) and occasional high signal nearly periodic intervals corresponding to the Galactic plane crossing; Properties slowly (i.e., piece-wise stationary) evolving with time and have to be derived directly from the data themselves; i. instrumental noise: piece-wise stationary with 1/f correlations; ii. responses, etc. Uniform sampling ( ~ 5 msec for HFI detectors), but cosmic rays hits, planet crossings, other transients and telemetry drop-outs; Systematic effects (temperature drifts etc) common to most of the bolometric detectors; Two types of the detectors (LFI radiometers and HFI bolometers); Simple scanning strategy. Planck will produce not only large but also complex data set to be analyzed. 5

The Planck data set - the context Probably the largest (or one of) CMB data sets available at the time; Almost two orders of magnitude larger than the WMAP data set ( ~50 Gbytes) and significantly more complex); Yet to be superseded soon by next generation of CMB experiments aiming at the detection and and characterization of the B-mode polarization and thus required to produce at least 2-3 orders of more data than Planck. Planck marks a transition in the CMB data analysis from interactive and subjective to automatic and objective. 6

CMB Data Analysis - I 7

CMB Data Analysis - II standard way: TOD Maps Power Spectra Cosmo Params + all the rest. Planck way: Maps TOD TOD level simulations Power Spectra Cosmo Params + all the rest. 8

CMB Data Analysis - III Approximate analysis algorithm kernels are : i. FFT in time-domain - N t log 2 N t ii. SHT in pixel domain - N p 3/2 Implicit multiplications and/or inversions of a rank O( 10 8 ). i. PCG solvers [O(10 2 )]; ii. Sparse matrix algebra; iii. Monte Carlo realizations [O(10 4 )]. For Planck : N t ~ 5 x 10 11 (72 detectors, 12 months) N p ~ 3 x 5 x 10 7 N l ~ 6 x 3 x 10 3 (i, q & u maps) (TT, TE, EE, BB, TB & EB spectra) 9

Planck computational resource forecasting O(10-100) Exaflop of total processing capacity (per year); O(10) TB of total memory; O(10) GB of memory per processor; Fast inter-processor and inter-nodes communication (latency less of an issue); O(100) TB of archival file storage for primary data and derived data products; O(10) TB of scratch file storage for temporary products; O(1-10) GB of local tmp storage on each processor or node for out-of-core calculations and internal code products; General single file system; fast, scalable I/O suitable for reading and writing of huge data sets, and allowing for a simultaneous disc access of multiple massively parallel I/O-heavy applications; Special hardware facilitating fast Fourier and spherical harmonic transforms; Fast (possibly Gigabit or better) network connections with DPC computers, and fast connection with other supercomputing centers involved in the Planck effort. 10

CMB Data Analysis - IV Planck data analysis will require O(10 19 ) operations - 1 ExaFlop (10 million Tflop) - or more! Planck marks a transition in the CMB data analysis from serial to parallel on basically all levels of the data processing. 11

Time domain data analysis tasks Goal: to create and test a data model and use it to estimate the sky signals (maps) Pointing reconstruction; Noise estimation and characterization; i. Gaussianity/stationarity tests; ii. joint frequency-time analysis, etc Deglitching; Instrumental effects (de-)convolution; Calibration: i. detector responsivity; ii. bolometric constant determination; ii. beam reconstruction. Systematic effects reconstruction and modelling ; Map-making and some of component separations (time domain based) techniques; Time-domain based point source extraction techniques; Map error estimation; Time domain based simulations. 12

Pixel domain tasks Image processing/filtering/smoothing; Pixel-based component separation (e.g., ICA based methods); Pixel domain point source subtraction; Non-Gaussianity tests; Power spectrum estimation: i. monte carlo (pseudo-cls) techniques; ii. bayesian techniques: a. unwanted template marginalization; iii. hybrid approaches.. 13

Planck data analysis highlights Computationally very heavy; Sophisticated, novel signal, image and data processing techniques; Diverse and heterogeneous; Parallel, very efficient, high performance numerical tools and implementations. 14

Total power vs. interferometric experiments Radiometer based interferometers mean simpler time domain data and processing (at least in CMB, e.g., DASI and CBI); The total power techniques (like these developed for Planck) have often straightforward applications for the interferometers, where modes of the (u,v) plane replace the sky pixels, e.g., DASI: in particular, i. map-making techniques; ii. power spectrum estimators (with fancy template marginalizations); iii. component separation/point source extraction algorithms; iv. non-gaussianity tools.. 15

Planck/CMB data analysis at APC ~10 researchers plus students and (software) engineers; with interests and an active involvement in different (and basically all) aspects of the data analysis: 1. infrastructure (data bases, I/O environments, etc); 2. Time domain processing; 3. Map-making, image processing, component separation, etc. 4. Power spectrum estimators and non-gaussianity tests. And their scientific exploitation. Involved in a number of past (e.g., Boomerang, MAXIMA, Archeops), current (e.g., QUAD, Bicep, APEX) and future (Planck, Brain, EBEx ) CMB experiments. 16

Conclusions The Planck data analysis will produce a legacy of an entire slew of methods, algorithms, their implementations and high performance software pieces which can be directly for some of the scientific goals of the LOFAR project. A natural synergy seems to possible between the data analysis pipeline and software development for the Epoch Of Reionization project of LOFAR and Planck. This work in the context of LOFAR could potentially benefit from the Planck DA experiences. Planck will be a WMAP of the E-mode pinning down all the details of the E-mode reionization bump down to sampling variance limit. A potential synergy between the science goals and science exploitation of the Planck and LOFAR data sets aiming at constraining the nature and details of the cosmological reionization. 17