Astro-Informatics: Computation in the study of the Universe

Similar documents
Real Astronomy from Virtual Observatories

Doing astronomy with SDSS from your armchair

RLW paper titles:

Astroinformatics in the data-driven Astronomy

Modern Image Processing Techniques in Astronomical Sky Surveys

An intelligent client application for on-line astronomical information

Lecture Outline: Chapter 5: Telescopes

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

The INES Archive in the era of Virtual Observatories. E. Solano Spanish Virtual Observatory LAEX CAB / INTA-CSIC

Scientific Data Flood. Large Science Project. Pipeline

Multi-wavelength Astronomy

Lecture Fall, 2005 Astronomy 110 1

Clustering of Galaxies in an Expanding Universe IAS School January 26, 2012 Nanyang Technological University Singapore

PHY 475/375. Lecture 2. (March 28, 2012) The Scale of the Universe: The Shapley-Curtis Debate

D4.2. First release of on-line science-oriented tutorials

Supporting Collaboration for the Gravitational Wave Astronomy Community with InCommon

Astronomical Notes. Astronomische Nachrichten. A machine learning classification broker for the LSST transient database

New physics is learnt from extreme or fundamental things

The Universe in the Cloud. Darren Croton Centre for Astrophysics and Supercomputing Swinburne University

9.1 Large Scale Structure: Basic Observations and Redshift Surveys

Yuji Shirasaki. National Astronomical Observatory of Japan. 2010/10/13 Shanghai, China

ASTR 2310: Chapter 6

Review of Lecture 15 3/17/10. Lecture 15: Dark Matter and the Cosmic Web (plus Gamma Ray Bursts) Prof. Tom Megeath

Astronomy 422. Lecture 15: Expansion and Large Scale Structure of the Universe

ESASky, ESA s new open-science portal for ESA space astronomy missions

Large Scale Structure of the Universe Lab

HOST: Welcome to Astronomy Behind the Headlines, a podcast by the Astronomical

Galaxies. Introduction. Different Types of Galaxy. Teacher s Notes. Shape. 1. Download these notes at

Dark Energy in Our Expanding Universe Joe Mohr Department of Astronomy Department of Physics NCSA

Chapter 5: Telescopes

Astronomy 113. Dr. Joseph E. Pesce, Ph.D. Distances & the Milky Way. The Curtis View. Our Galaxy. The Shapley View 3/27/18

Astronomy 113. Dr. Joseph E. Pesce, Ph.D. Dr. Joseph E. Pesce, Ph.D.

Galaxy formation and evolution. Astro 850

Lecture 11: SDSS Sources at Other Wavelengths: From X rays to radio. Astr 598: Astronomy with SDSS

Galaxy Growth and Classification

ASTERICS Astronomy ESFRI & Research Infrastructure Cluster

Introduction to the Sloan Survey

Homework #7: Properties of Galaxies in the Hubble Deep Field Name: Due: Friday, October points Profs. Rieke

Earth Science Lesson Plan Quarter 4, Week 10, Day 1

[FILE] MILKY WAY AT HOME EBOOK

Lecture Outlines. Chapter 25. Astronomy Today 7th Edition Chaisson/McMillan Pearson Education, Inc.

Science in the Virtual Observatory

AS1001:Extra-Galactic Astronomy. Lecture 3: Galaxy Fundamentals

The Cosmological Redshift. Cepheid Variables. Hubble s Diagram

Machine Learning Applications in Astronomy

A Library of the X-ray Universe: Generating the XMM-Newton Source Catalogues

Chapter 5 Telescopes

Observational Astronomy

Stellar Astronomy 1401 Spring 2009

Current Status of Chinese Virtual Observatory


Challenges of Predictive 3D Astrophysical Simulations of Accreting Systems. John F. Hawley Department of Astronomy, University of Virginia

Department of Astrophysical Sciences Peyton Hall Princeton, New Jersey Telephone: (609)

Light and Telescope 10/24/2018. PHYS 1403 Introduction to Astronomy. Reminder/Announcement. Chapter Outline. Chapter Outline (continued)

Collecting Light. In a dark-adapted eye, the iris is fully open and the pupil has a diameter of about 7 mm. pupil

Euro-VO. F. Genova, Interoperability meeting, 9 November 2009

NASA/IPAC EXTRAGALACTIC DATABASE

The World Bank and the Open Geospatial Web. Chris Holmes

TWO SMALL PIECES OF GLASS A Space Science Program for Grades 5-12

Virtual Observatories and Developing Countries

FIVE FUNDED* RESEARCH POSITIONS

Homework on Properties of Galaxies in the Hubble Deep Field Name: Due: Friday, April 8 30 points Prof. Rieke & TA Melissa Halford

Lab 1: Measurement Errors Adapted from Holtzman's Intro Lab for Astr110

A SPEctra Clustering Tool for the exploration of large spectroscopic surveys. Philipp Schalldach (HU Berlin & TLS Tautenburg, Germany)

BAYESIAN CROSS-IDENTIFICATION IN ASTRONOMY

Foundations of Astronomy 13e Seeds. Chapter 6. Light and Telescopes

X-ray Astronomy F R O M V - R O CKETS TO AT HENA MISSION. Thanassis Akylas

Data challenges of the Virtual Observatory in Time Domain Astronomy

Astronomical imagers. ASTR320 Monday February 18, 2019

Chandra: Revolution through Resolution. Martin Elvis, Chandra X-ray Center

Hubblecast Episode 91: What does the future hold for Hubble? Part I. Visual notes

The Secrets of Galaxies. Student s Guide Advanced Level CESAR s Science Case

Writing very large numbers

PHY323:Lecture 7 Dark Matter with Gravitational Lensing

Renewal Strings for Cleaning Astronomical Databases

Clusters: Observations

Astronomy of the Next Decade: From Photons to Petabytes. R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST

Summary. Week 7: 10/5 & 10/ Learning from Light. What are the three basic types of spectra? Three Types of Spectra

The SDSS Data. Processing the Data

2. OBSERVATIONAL COSMOLOGY

COMA CLUSTER OF GALAXIES

1 Lecture, 2 September 1999

Tools of Modern Astronomy

A.C.R.E and. C3S Data Rescue Capacity Building Workshops. December 4-8, 2017 Auckland, New Zealand. Session 3: Rescue of Large Format and Analog Data

Today: Start Ch. 18: Cosmology. Homework # 5 due next Wed. (HW #6 is online)

Galaxies Guiding Questions

Lecture 25 The Milky Way Galaxy November 29, 2017

The NASA/IPAC Infrared Science Archive (IRSA): The Demo

Problem Solving. radians. 180 radians Stars & Elementary Astrophysics: Introduction Press F1 for Help 41. f s. picture. equation.

Big Data Era in Astronomical Educational Activities in Armenia

Lab 1: Dark Matter in Galaxy Clusters Dynamical Masses, Strong Lensing

Learning algorithms at the service of WISE survey

Astro-lab at the Landessternwarte Heidelberg. Overview astro-lab & introduction to tasks. Overview astro-lab

Todays Topics 3/19/2018. Light and Telescope. PHYS 1403 Introduction to Astronomy. CCD Camera Makes Digital Images. Astronomical Detectors

the open-source sky survey David W. Hogg (NYU)

The 2dF Galaxy Redshift Survey Matthew Colless. Encyclopedia of Astronomy & Astrophysics P. Murdin

ASTRONOMY. Chapter 5 RADIATION AND SPECTRA PowerPoint Image Slideshow

Astroinformatics: massive data research in Astronomy Kirk Borne Dept of Computational & Data Sciences George Mason University

Lecture 14: Other Galaxies A2020 Prof. Tom Megeath. The Milky Way in the Infrared 3/17/10. NGC 7331: the Milky Way s Twins. Spiral Galaxy bulge halo

[ [ CALENDARS AND CONSTELLATIONS OF THE ANCIENT WORLD BY

Transcription:

Astro-Informatics: Computation in the study of the Universe Bob Mann and Andy Lawrence Institute for Astronomy, School of Physics (rgm@roe.ac.uk & al@roe.ac.uk)

Plan Computational Astrophysics N-body simulations of galaxy clustering Astro-Informatics Survey astronomy & the Virtual Observatory Discussion Astronomy and informatics

Plan Computational Astrophysics N-body simulations of galaxy clustering Astro-Informatics Survey astronomy & the Virtual Observatory Discussion Astronomy and informatics

Observing galaxy clustering 1930s: Hubble Galaxies aren t t uniformly distributed on sky 1950s: Shane and Wirtanen Map of galaxy distribution on the sky from counting 100,000 galaxies by eye (10 years!) 1980s: CfA Redshift Survey (Huchra,, Geller, de Lapparent) First sizeable 3D map of the local Universe Measured rough distances to ~11,000 galaxies

1985: first CfA survey 3D map of a pyramidal slice of space, projected into 2D ~500 million light years Rich structure walls, filaments, voids How to explain this richness of structure?

Modelling galaxy clustering Physics simple in Cold Dark Matter model Collisionless material moving under gravity Apply perturbation theory to density field Linear theory treatment simple, but Perturbations non-linear on scales of interest Fourier modes couple, analytic methods fail Need numerical simulations to model galaxy clustering into non-linear regime Set up test masses and evolve under gravity: i.e. gravitational N-body simulations

Two decades of N-body N simulations 1985: Davis, Efstathiou, Frenk,, White (32) 3 particles <10 particles per galaxy Early success for Cold Dark Matter model 2005: Virgo Consortium Inc. John Peacock (IfA( IfA), plus EPCC (2202) 3 particles ~1000 particles per galaxy Mass resolution increased by a factor of ~10 2 and simulation volume by a factor of ~10 3

Theory v Observation Theory: VIRGO Observation: 2dFGRS (inc. John Peacock) ~250,000 galaxies (SDSS: ~500,000 galaxies) Quantitative clustering analysis reveals theory and observation in excellent agreement

Galaxy clustering summary Cold Dark Matter model accounts for the observed clustering of galaxies Major triumph of modern astronomy Numerical simulations crucial, but this is astronomers using computers, not astronomers using computer science Are there examples of real interaction between astronomy & computer science? More interesting than just number-crunching?

Plan Computational Astrophysics N-body simulations of galaxy clustering Astro-Informatics Survey astronomy & the Virtual Observatory Discussion Astronomy and informatics

(M51 graphics from Jim Gray & Alex Szalay) Observational Astronomy Electromagnetic spectrum Multiwavelength view of a spiral galaxy ROSAT ~kev DSS Optical IRAS 25µ IRAS 100µ GB 6cm NVSS 20cm Different angular resolution of instruments Different physical emission mechanisms WENSS 92cm

Changes in the way that we make observations Old Style: Many small, specific programmes Astronomer proposes observations, goes to telescope, brings data home on tape, analyses data, publishes paper, puts tape in desk drawer and forgets about it New Style: Few large, multi-use use surveys Consortium designs survey to address many science goals, undertakes survey over several years, establishes database many people do different science with same data from DB

Trends behind these changes Instruments made easier to use & more effort put into data reduction software Easier to use data from new instrument Multiwavelength astronomy much easier Instruments are more sensitive and have more detector elements Can image large areas of sky quickly Survey mode of observation more efficient

Very strong local interest Wide Field Astronomy Unit Part of the UoE Institute for Astronomy Based at Royal Observatory Edinburgh, on Blackford Hill Two strands to WFAU work Curation of optical/near-infrared infrared sky surveys Helping build the global Virtual Observatory

The Virtual Observatory Goals Federate all the world s s astronomy data Provide resources for exploitation of data Challenges sociological & technical Heterogeneous, distributed datasets Lack of global schema; metadata often poor Legacy analysis codes in many languages Solution International collaboration Architecture built on web services

DB1 Schematic Virtual Observatory Registry DB2 Compute Resource DB3 DB4 User

WFAU s computational problems Quality Control Spatial Indexing Analysis close to DB Provenance Individual sky survey archives: scale Lack of Global Schema Query Language Difficulty in Making Joins Integration with the Literature Virtual Observatory: interoperability

Quality control: automated junk detection SuperCOSMOS Sky Survey Scans of photographic plates ~1800 plates cover whole sky Image analyser run over images ~250,000 sources per plate Classes of spurious source Trails: satellites, aeroplanes, Diffraction effects around bright stars How to find these spurious sources?

Quality control: automated junk detection (2) Junk found in unusual configurations Lines, circles: the eye spots them easily but can t t eyeball thousands of plates! Amos Storkey,, Chris Williams, Nigel Hambly Developed new generative method, based on unlikeliness of configurations

Analysing sky survey data WFAU has multi-tb sky survey databases Many analyses will use much of the data e.g. finding one-in in-a-million unusual objects e.g. quantifying properties of populations Users can t t download data to workstation WFAU must provide analysis services on DB Security issues if users upload their code Application of mobile code security work? discussion started with Don Sannella s group

Difficulty of matching entries between sky survey databases Angular resolution varies between datasets Matching by spatial proximity is inadequate

Difficulty of matching entries between sky survey databases (2) Probabilistic framework well established But need to know properties of source populations Often not the case Learn the probabilities for matching different classes of source iteratively (EM algorithm) Emma Taylor (PhD), with Amos Storkey & Chris Williams

Difficulty of matching entries between sky survey databases (3) Sophisticated matching algorithms are often computationally expensive Want to cache matches for re-use AstroDAS: : Diego Prina Ricotti, Raj Bose Distributed annotation server for astronomy Annotation Server Optical X-ray Radio

Integrating the online literature into the VO If we find an interesting object, we frequently want to ask questions like: What s s known about this area of sky? What s s known about objects like this? Have objects like this been reported before? Literature is too large to search manually Can text mining techniques help?

Integrating the online literature into the VO (2) AstroNER: : Named Entity Recognition Claire Grover, Ben Hachey et al. Look at abstracts of journal articles related to spectroscopy of active galaxies Try to identify nouns of four types instrument-name, name, spectral-feature, source-type, source-name Apply various techniques, using training data annotated by astro PhD students

Plan Computational Astrophysics N-body simulations of galaxy clustering Astro-Informatics Survey astronomy & the Virtual Observatory Discussion Astronomy and informatics

Two classes of research Computational Astrophysics Astronomers using computers to solve a specific problem in astrophysics Astro-Informatics Astronomers and computer scientists collaborating in the application of computational techniques to astronomy

c.f. distinction made by Jim Gray (Microsoft) Comp-X X-ologists using computers to solve a specific problem in X-ologyX X-Info X-ologists and computer scientists collaborating in the application of computational techniques to X-ologyX

Comp-X X & X-info X compared Comp-X Involves only X-ologistsX Should be funded as X-ologyX research X-Info Requires X-ologistsX and computer scientists How should this be funded? Can both sides be kept happy? Comp-X/X X/X-Info boundary is domain-specific Particle physics is almost all Comp-X Biology is mainly X-info X bioinformatics Astronomy is a mixture of both

Can X-info X work? Example of successful X-info: X PiCA group Pittsburgh Computational Astrostatistics Group Sustained collaboration: 1999 onwards Astronomy, CS and statistics expertise Focus on scalable data mining algorithms Astro requirements drive research in both statistics and CS Andrew Moore

Can X-info X work here? It is!...to some extent as this lecture series illustrates I ve described several astro-info projects How can we do X-info X better? Sustained interactions Understand areas of mutual interest Give-and and-take over individual projects..which require funding e.g. cross-school PhD studentships

Summary & Conclusions Astronomy relies on computation On both theoretical and observational sides In both Comp-X X and X-info X modes Astronomy is a good X for X-infoX Data: free, voluminous, no ethical issues Needs storing, indexing, describing, mining Challenge: how to make X-info X work well Huge rewards for {X} and informatics