Engineering considerations for large astrophysics projects

Size: px
Start display at page:

Download "Engineering considerations for large astrophysics projects"

Transcription

1 Engineering considerations for large astrophysics projects David W. Hogg Center for Cosmology and Particle Physics Department of Physics New York University Max-Planck-Institut für Astronomie Heidelberg, Germany 2014 January 9

2 punchlines Calibration programs are wasteful and reduce the accuracy of your end-of-mission results. (you will need to adjust your observing strategy) Homogeneity and uniformity of survey samples are impossible, unnecessary, and harmful goals. (you will need to implement some probability theory) Proper uncertainty propagation is not easy. (I got nothing) The challenge is to make precise measurements and keep discovery space open. (you will need to understand, quantitatively, your goals)

3 my teachers (incomplete list) Gerry Neugebauer (Caltech, emeritus) Sam Roweis (Toronto & NYU, deceased) Dave Schlegel (LBL) & Scott Burles (Cutler) Mike Blanton (NYU) Dustin Lang (CMU) & Jo Bovy (IAS) & Dan Foreman-Mackey (NYU)

4 survey-centric context Gaia SKA and pathfinders Euclid LSST SDSS-IV... and many more (I am going to get mean at the end.)

5 my day job Astrometry.net and TheTractor emcee and kplr precision measurement, probabilistic inference data-driven models

6 homogeneity and uniformity are impossible weather target selection hardware evolution efficiency considerations

7 probabilistic target selection SDSS-III SDSS-III BOSS quasar target selection in SDSS bandpasses, z 3 quasars look like A-type stars stars outnumber quasars enormously don t have good models of either Bovy et al. arxiv: this target selection cannot be uniform heterogeneous data quality means heterogeneous target selection star density varies on the sky suck it up!

8 homogeneity and uniformity are unnecessary correct the data compute inverse selection volume or probabilities 1/V max (ish) re-weight the data using these inverse volumes very wrong! forward modeling write down uncensored p0 (data parameters) multiply by (one minus) censoring rate η(data) renormalize to get expected p(data parameters) this is a likelihood function (but: visualizing a forward model)

9 estimators Cramèr Rao bound example: Gaia astrometry likelihood principle(s) it is our duty to analyze our very limited data with optimal methods the output of any data analysis must be a likelihood function WMAP, Planck

10 likelihood principle I said function. p(data parameters)

11 living the likelihood dream don t make a catalog of objects that s some kind of (probably inefficient) estimator even with error bars it can t transmit the full information produce a likelihood function in catalog space Lang et al. Brewer et al. arxiv:

12

13

14

15 homogeneity and uniformity are unnecessary? special case of two-point functions (and higher orders) currently an unsolved problem (but papers from Wandelt s group)

16 homogeneity and uniformity are harmful can t be uniform in everything the uniformity you choose only helps one of your customers! uniform samples end up requiring a lot of time on the least useful objects reduces the heterogeneity that is essential to calibration

17 self-calibration final imaging calibration of SDSS made no use at all of the calibration program data Padmanabhan et al. arxiv:astro-ph/

18

19

20 calibration programs are wasteful there are more photons in the science data therefore, the science data contain more information about calibration (exceptions abound) you must take your data with proper heterogeneity! Kepler tiling patterns Holmes et al. arxiv:

21 A 4 B Sky Position β (deg) C 4 D Sky Position β (deg) Sky Position α (deg) Sky Position α (deg)

22 1.0 Focal Plane Position y (deg) Focal Plane Position y (deg) Focal Plane Position x (deg) (a) (c) Focal Plane Position x (deg) (b) (d) Residuals (%) Focal Plane Position y (deg) Focal Plane Position y (deg) Focal Plane Position x (deg) (a) (c) Focal Plane Position x (deg) (b) (d) Residuals (%) (a) (b) (a) (b) 1.0 Focal Plane Position y (deg) Focal Plane Position y (deg) Focal Plane Position x (deg) (c) Focal Plane Position x (deg) (d) Residuals (%) Focal Plane Position y (deg) Focal Plane Position y (deg) Focal Plane Position x (deg) (c) Focal Plane Position x (deg) (d) Residuals (%)

23 Self-calibration of imaging A good survey: every star appears in many images in different images, the star is in different places every image contains many stars Holmes et al. arxiv: Kepler and Spitzer exoplanet photometry is pessimal for self-calibration but for a very good reason!

24 target selection is classification SDSS-III SDSS-III BOSS is taking spectra of quasars, not stars stars outnumber (relevant) quasars by factors of hundreds observations are noisy and theoretical models are incomplete want to find only the quasars... or do we?

25 classification algorithms Support Vector Machine, Random Forest, Artificial Neural Net all bad! value of a causal model training and test samples don t match need to classify new data taken under different conditions make use of our technical knowledge about the data. Bovy et al. arxiv:

26 1-epoch model 30-epoch

27

28

29

30 aside: discovery as classification found an exoplanet? That s a model selection move. Bayes doesn t tell you how to make decisions. utility arises Make decisions that maximize expected (scientific?) return. Astrometry.net has an explicit utility model Automatic calibration of an image successful? Our customer model is that they are offended by false positives. Lang et al. arxiv:

31 utility considerations might be worth taking a source unlikely to be a quasar, as long as it is likely to be interesting need to be able to make these trade-offs quantitatively requires a specification of utility needs to be measured in dollars (or equivalent) long-term future discounted free cash flow the game of proposal writing we aren t honest in our proposals about what we want SDSS was over-designed by any measure that was valuable!

32 over-design SDSS was seriously over-designed to measure the large-scale structure (no-one thinks that was a bad idea) could have done all the large-scale structure in less than one year of observing we might have to be more honest going forward if we want to use resources efficiently, we need to face a trade-off between efficiency and discovery At the present, everything is heuristics. I say we make this trade-off explicitly, not implicitly.

33 utopia every part of your data analysis pipeline returns a likelihood function information propagation through the pipeline always by likelihood function implications are severe you can simulate data under different experimental designs likelihood is p(data parameters) you have a specified utility function converts information in your answer into dollars every decision can now be an optimization detectors, optical path, spectral elements filters, exposure times, cadences targets

34 example: bandpasses LSST plans to do imaging in ugrizy I am going to smash that r filter! why not do ugwizy? easy example because zero-cost change doesn t require full utility specification bet it is much better for low-s/n objects

35

36 hardware vs software trades P1640 Oppenheimer et al. arxiv: Fergus et al. in prep glitter cam Fergus et al. MIT-CSAIL-TR

37

38

39

40

41

42 open-source surveys Hipparcos example SDSS calibration example enormous benefits accrue from making the data re-reducable from scratch

43 throwing down the gauntlet Gaia uncertainty propagation (qualitative) Euclid observing strategy for imaging LSST bandpass, cadence, and exposure-time settings SKA pathfinder image products eboss two-point function estimators APOGEE & HERMES signal-to-noise requirements (My hourly rates are a bargain.) (These surveys are all awesome!)

44 punchlines Calibration programs are wasteful and reduce the accuracy of your end-of-mission results. (you will need to adjust your observing strategy) Homogeneity and uniformity of survey samples are impossible, unnecessary, and harmful goals. (you will need to implement some probability theory) Proper uncertainty propagation is not easy. (I got nothing) The challenge is to make precise measurements and keep discovery space open. (you will need to understand, quantitatively, your goals)

Designing Astronomical Projects for Real-Time Discovery, Operational Efficiency, and Statistical Re-Use of Legacy Data

Designing Astronomical Projects for Real-Time Discovery, Operational Efficiency, and Statistical Re-Use of Legacy Data Designing Astronomical Projects for Real-Time Discovery, Operational Efficiency, and Statistical Re-Use of Legacy Data Thank you Megan Bedell (Flatiron) Josh Bloom (Berkeley) Doug Finkbeiner (Harvard)

More information

Data-driven models of stars

Data-driven models of stars Data-driven models of stars David W. Hogg Center for Cosmology and Particle Physics, New York University Center for Data Science, New York University Max-Planck-Insitut für Astronomie, Heidelberg 2015

More information

Probabilistic modeling and Inference in Astronomy

Probabilistic modeling and Inference in Astronomy Probabilistic modeling and Inference in Astronomy Dan Foreman-Mackey Sagan Fellow, University of Washington github.com/dfm // @exoplaneteer // dfm.io Dan Foreman-Mackey Sagan Fellow, University of Washington

More information

Making precise and accurate measurements with data-driven models

Making precise and accurate measurements with data-driven models Making precise and accurate measurements with data-driven models David W. Hogg Center for Cosmology and Particle Physics, Dept. Physics, NYU Center for Data Science, NYU Max-Planck-Insitut für Astronomie,

More information

(Slides for Tue start here.)

(Slides for Tue start here.) (Slides for Tue start here.) Science with Large Samples 3:30-5:00, Tue Feb 20 Chairs: Knut Olsen & Melissa Graham Please sit roughly by science interest. Form small groups of 6-8. Assign a scribe. Multiple/blended

More information

Combining probabilities

Combining probabilities prepared by: approved by: reference: issue: 2 revision: 0 date: 2011-07-19 status: Issued Abstract Coryn A.L. Bailer-Jones, Kester Smith Max Planck Institute for Astronomy, Heidelberg Email: calj@mpia.de

More information

Constraining Dark Energy with BOSS. Nicolas Busca - APC Rencontres de Moriond 19/10/2010

Constraining Dark Energy with BOSS. Nicolas Busca - APC Rencontres de Moriond 19/10/2010 Constraining Dark Energy with BOSS Nicolas Busca - APC Rencontres de Moriond 19/10/2010 Outline BOSS: What? How? Current Status of BOSS Constraints on Cosmology Conclusions What: Baryon Acoustic Oscilations

More information

The Gaia Mission. Coryn Bailer-Jones Max Planck Institute for Astronomy Heidelberg, Germany. ISYA 2016, Tehran

The Gaia Mission. Coryn Bailer-Jones Max Planck Institute for Astronomy Heidelberg, Germany. ISYA 2016, Tehran The Gaia Mission Coryn Bailer-Jones Max Planck Institute for Astronomy Heidelberg, Germany ISYA 2016, Tehran What Gaia should ultimately achieve high accuracy positions, parallaxes, proper motions e.g.

More information

JINA Observations, Now and in the Near Future

JINA Observations, Now and in the Near Future JINA Observations, Now and in the Near Future Timothy C. Beers Department of Physics & Astronomy Michigan State University & JINA: Joint Institute for Nuclear Astrophysics Examples SDSS-I, II, and III

More information

the open-source sky survey David W. Hogg (NYU)

the open-source sky survey David W. Hogg (NYU) the open-source sky survey David W. Hogg (NYU) http://astrometry.net/ non-text searching need to search things that aren t text, with queries that aren t text even image search in Google requires accurate

More information

Basic BAO methodology Pressure waves that propagate in the pre-recombination universe imprint a characteristic scale on

Basic BAO methodology Pressure waves that propagate in the pre-recombination universe imprint a characteristic scale on Precision Cosmology With Large Scale Structure, Ohio State University ICTP Cosmology Summer School 2015 Lecture 3: Observational Prospects I have cut this lecture back to be mostly about BAO because I

More information

Cosmology on the Beach: Experiment to Cosmology

Cosmology on the Beach: Experiment to Cosmology Image sky Select targets Design plug-plates Plug fibers Observe! Extract spectra Subtract sky spec. Cosmology on the Beach: Experiment to Cosmology Fit redshift Make 3-D map Test physics! David Schlegel!1

More information

Astronomical image reduction using the Tractor

Astronomical image reduction using the Tractor the Tractor DECaLS Fin Astronomical image reduction using the Tractor Dustin Lang McWilliams Postdoc Fellow Carnegie Mellon University visiting University of Waterloo UW / 2015-03-31 1 Astronomical image

More information

Andy Casey Astrophysics; Statistics

Andy Casey Astrophysics; Statistics Science in the era of Gaia data big Astrophysics; Statistics andycasey astrowizicist astrowizici.st Science in the era of Gaia data big - The Gaia mission All about Gaia. What makes data big? Science in

More information

Present and future redshift survey David Schlegel, Berkeley Lab

Present and future redshift survey David Schlegel, Berkeley Lab Present and future redshift survey David Schlegel, Berkeley Lab David Schlegel, COSMO-17 @Paris, 30 Aug 2017 1 Redshift surveys = one of ~few probes of inflationary epoch Inflation-era parameters: non-gaussianity,

More information

CS 188: Artificial Intelligence Spring Today

CS 188: Artificial Intelligence Spring Today CS 188: Artificial Intelligence Spring 2006 Lecture 9: Naïve Bayes 2/14/2006 Dan Klein UC Berkeley Many slides from either Stuart Russell or Andrew Moore Bayes rule Today Expectations and utilities Naïve

More information

The Dark Energy Survey Public Data Release 1

The Dark Energy Survey Public Data Release 1 The Dark Energy Survey Public Data Release 1 Matias Carrasco Kind (NCSA/UIUC) and the DR1 Release Team https://des.ncsa.illinois.edu/ Near-Field Cosmology with DES DR1 and Beyond Workshop, June 27-29th,

More information

From the Big Bang to Big Data. Ofer Lahav (UCL)

From the Big Bang to Big Data. Ofer Lahav (UCL) From the Big Bang to Big Data Ofer Lahav (UCL) 1 Outline What is Big Data? What does it mean to computer scientists vs physicists? The Alan Turing Institute Machine learning examples from Astronomy The

More information

Dark Energy. Cluster counts, weak lensing & Supernovae Ia all in one survey. Survey (DES)

Dark Energy. Cluster counts, weak lensing & Supernovae Ia all in one survey. Survey (DES) Dark Energy Cluster counts, weak lensing & Supernovae Ia all in one survey Survey (DES) What is it? The DES Collaboration will build and use a wide field optical imager (DECam) to perform a wide area,

More information

Machine Learning Applications in Astronomy

Machine Learning Applications in Astronomy Machine Learning Applications in Astronomy Umaa Rebbapragada, Ph.D. Machine Learning and Instrument Autonomy Group Big Data Task Force November 1, 2017 Research described in this presentation was carried

More information

SkyMapper and the Southern Sky Survey

SkyMapper and the Southern Sky Survey SkyMapper and the Southern Sky Survey Stefan Keller Mt. Stromlo Observatory Brian Schmidt, Mike Bessell and Patrick Tisserand SkyMapper 1.35m telescope with a 5.7 sq. degree field of view located at Siding

More information

ROSAT Roentgen Satellite. Chandra X-ray Observatory

ROSAT Roentgen Satellite. Chandra X-ray Observatory ROSAT Roentgen Satellite Joint facility: US, Germany, UK Operated 1990 1999 All-sky survey + pointed observations Chandra X-ray Observatory US Mission Operating 1999 present Pointed observations How do

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 23: Perceptrons 11/20/2008 Dan Klein UC Berkeley 1 General Naïve Bayes A general naive Bayes model: C E 1 E 2 E n We only specify how each feature depends

More information

General Naïve Bayes. CS 188: Artificial Intelligence Fall Example: Overfitting. Example: OCR. Example: Spam Filtering. Example: Spam Filtering

General Naïve Bayes. CS 188: Artificial Intelligence Fall Example: Overfitting. Example: OCR. Example: Spam Filtering. Example: Spam Filtering CS 188: Artificial Intelligence Fall 2008 General Naïve Bayes A general naive Bayes model: C Lecture 23: Perceptrons 11/20/2008 E 1 E 2 E n Dan Klein UC Berkeley We only specify how each feature depends

More information

TMT and Space-Based Survey Missions

TMT and Space-Based Survey Missions TMT and Space-Based Survey Missions Daniel Stern Jet Propulsion Laboratory/ California Institute of Technology 2014 California Institute of Technology TMT Science Forum 2014 July 17 Outline Summary of

More information

Science Results Enabled by SDSS Astrometric Observations

Science Results Enabled by SDSS Astrometric Observations Science Results Enabled by SDSS Astrometric Observations Željko Ivezić 1, Mario Jurić 2, Nick Bond 2, Jeff Munn 3, Robert Lupton 2, et al. 1 University of Washington 2 Princeton University 3 USNO Flagstaff

More information

Lecture : Probabilistic Machine Learning

Lecture : Probabilistic Machine Learning Lecture : Probabilistic Machine Learning Riashat Islam Reasoning and Learning Lab McGill University September 11, 2018 ML : Many Methods with Many Links Modelling Views of Machine Learning Machine Learning

More information

BigBOSS Data Reduction Software

BigBOSS Data Reduction Software BigBOSS Data Reduction Software The University of Utah Department of Physics & Astronomy The Premise BigBOSS data reduction software is as important as BigBOSS data collection hardware to the scientific

More information

Ways to make neural networks generalize better

Ways to make neural networks generalize better Ways to make neural networks generalize better Seminar in Deep Learning University of Tartu 04 / 10 / 2014 Pihel Saatmann Topics Overview of ways to improve generalization Limiting the size of the weights

More information

The Galaxy Zoo Project

The Galaxy Zoo Project Astronomy 201: Cosmology Fall 2009 Prof. Bechtold NAME: The Galaxy Zoo Project 200 points Due: Nov. 23, 2010, in class Professional astronomers often have to search through enormous quantities of data

More information

Introduction to AI Learning Bayesian networks. Vibhav Gogate

Introduction to AI Learning Bayesian networks. Vibhav Gogate Introduction to AI Learning Bayesian networks Vibhav Gogate Inductive Learning in a nutshell Given: Data Examples of a function (X, F(X)) Predict function F(X) for new examples X Discrete F(X): Classification

More information

Lyman-α Cosmology with BOSS Julián Bautista University of Utah. Rencontres du Vietnam Cosmology 2015

Lyman-α Cosmology with BOSS Julián Bautista University of Utah. Rencontres du Vietnam Cosmology 2015 Lyman-α Cosmology with BOSS Julián Bautista University of Utah Rencontres du Vietnam Cosmology 2015 Lyman-α Forest of a Quasar (by Andrew Pontzen) F ( obs )= Observed flux Unabsorbed flux ( obs) 2 Lyman-α

More information

LSST Science. Željko Ivezić, LSST Project Scientist University of Washington

LSST Science. Željko Ivezić, LSST Project Scientist University of Washington LSST Science Željko Ivezić, LSST Project Scientist University of Washington LSST@Europe, Cambridge, UK, Sep 9-12, 2013 OUTLINE Brief overview of LSST science drivers LSST science-driven design Examples

More information

Exoplanet Microlensing Surveys with WFIRST and Euclid. David Bennett University of Notre Dame

Exoplanet Microlensing Surveys with WFIRST and Euclid. David Bennett University of Notre Dame Exoplanet Microlensing Surveys with WFIRST and Euclid David Bennett University of Notre Dame Why Space-based Microlensing? Space-based microlensing is critical for our understanding of exoplanet demographics

More information

Optical/NIR Spectroscopy A3130. John Wilson Univ of Virginia

Optical/NIR Spectroscopy A3130. John Wilson Univ of Virginia Optical/NIR Spectroscopy A3130 John Wilson Univ of Virginia Topics: Photometry is low resolution spectroscopy Uses of spectroscopy in astronomy Data cubes and dimensionality challenge Spectrograph design

More information

Surprise Detection in Science Data Streams Kirk Borne Dept of Computational & Data Sciences George Mason University

Surprise Detection in Science Data Streams Kirk Borne Dept of Computational & Data Sciences George Mason University Surprise Detection in Science Data Streams Kirk Borne Dept of Computational & Data Sciences George Mason University kborne@gmu.edu, http://classweb.gmu.edu/kborne/ Outline Astroinformatics Example Application:

More information

An end-to-end simulation framework for the Large Synoptic Survey Telescope Andrew Connolly University of Washington

An end-to-end simulation framework for the Large Synoptic Survey Telescope Andrew Connolly University of Washington An end-to-end simulation framework for the Large Synoptic Survey Telescope Andrew Connolly University of Washington LSST in a nutshell The LSST will be a large, wide-field, ground-based optical/near-ir

More information

Data Analysis Methods

Data Analysis Methods Data Analysis Methods Successes, Opportunities, and Challenges Chad M. Schafer Department of Statistics & Data Science Carnegie Mellon University cschafer@cmu.edu The Astro/Data Science Community LSST

More information

Inference of cluster distance and geometry from astrometry

Inference of cluster distance and geometry from astrometry Inference of cluster distance and geometry from astrometry Coryn A.L. Bailer-Jones Max Planck Institute for Astronomy, Heidelberg Email: calj@mpia.de 2017-12-01 Abstract Here I outline a method to determine

More information

(Present and) Future Surveys for Metal-Poor Stars

(Present and) Future Surveys for Metal-Poor Stars (Present and) Future Surveys for Metal-Poor Stars Timothy C. Beers Department of Physics & Astronomy Michigan State University & JINA: Joint Institute for Nuclear Astrophysics SDSS 1 Why the Fascination

More information

Exoplanetary transits as seen by Gaia

Exoplanetary transits as seen by Gaia Highlights of Spanish Astrophysics VII, Proceedings of the X Scientific Meeting of the Spanish Astronomical Society held on July 9-13, 2012, in Valencia, Spain. J. C. Guirado, L.M. Lara, V. Quilis, and

More information

LSST, Euclid, and WFIRST

LSST, Euclid, and WFIRST LSST, Euclid, and WFIRST Steven M. Kahn Kavli Institute for Particle Astrophysics and Cosmology SLAC National Accelerator Laboratory Stanford University SMK Perspective I believe I bring three potentially

More information

A8824: Statistics Notes David Weinberg, Astronomy 8824: Statistics Notes 6 Estimating Errors From Data

A8824: Statistics Notes David Weinberg, Astronomy 8824: Statistics Notes 6 Estimating Errors From Data Where does the error bar go? Astronomy 8824: Statistics otes 6 Estimating Errors From Data Suppose you measure the average depression of flux in a quasar caused by absorption from the Lyman-alpha forest.

More information

Lecture - 24 Radial Basis Function Networks: Cover s Theorem

Lecture - 24 Radial Basis Function Networks: Cover s Theorem Neural Network and Applications Prof. S. Sengupta Department of Electronic and Electrical Communication Engineering Indian Institute of Technology, Kharagpur Lecture - 24 Radial Basis Function Networks:

More information

and the University of Wisconsin Milwaukee

and the University of Wisconsin Milwaukee and the University of Wisconsin Milwaukee David Kaplan (University of Wisconsin, Milwaukee) kaplan@uwm.edu Leonard E. Parker Center for Gravitation, Cosmology & Astrophysics The Milwaukee Perspective +

More information

Some ML and AI challenges in current and future optical and near infra imaging datasets

Some ML and AI challenges in current and future optical and near infra imaging datasets Some ML and AI challenges in current and future optical and near infra imaging datasets Richard McMahon (Institute of Astronomy, University of Cambridge) and Cameron Lemon, Estelle Pons, Fernanda Ostrovski,

More information

From quasars to dark energy Adventures with the clustering of luminous red galaxies

From quasars to dark energy Adventures with the clustering of luminous red galaxies From quasars to dark energy Adventures with the clustering of luminous red galaxies Nikhil Padmanabhan 1 1 Lawrence Berkeley Labs 04-15-2008 / OSU CCAPP seminar N. Padmanabhan (LBL) Cosmology with LRGs

More information

Hogg & Fergus / CDI-Type I: A unified probabilistic model of astronomical imaging 1

Hogg & Fergus / CDI-Type I: A unified probabilistic model of astronomical imaging 1 Hogg & Fergus / CDI-Type I: A unified probabilistic model of astronomical imaging 1 1 Introduction The hierarchical structure of astrophysics: All astrophysics knowledge comes, ultimately, from photons

More information

Signal Modeling, Statistical Inference and Data Mining in Astrophysics

Signal Modeling, Statistical Inference and Data Mining in Astrophysics ASTRONOMY 6523 Spring 2013 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Course Approach The philosophy of the course reflects that of the instructor, who takes a dualistic view

More information

VISTA HEMISPHERE SURVEY DATA RELEASE 1

VISTA HEMISPHERE SURVEY DATA RELEASE 1 Release date (will be set by ESO) VISTA HEMISPHERE SURVEY DATA RELEASE 1 PROPOSAL ESO No.: 179.A-2010 PRINCIPAL INVESTIGATOR: Richard McMahon Authors: R. McMahon, M. Banerji, N. Lodieu for the VHS Collaboration

More information

Big Data Inference. Combining Hierarchical Bayes and Machine Learning to Improve Photometric Redshifts. Josh Speagle 1,

Big Data Inference. Combining Hierarchical Bayes and Machine Learning to Improve Photometric Redshifts. Josh Speagle 1, ICHASC, Apr. 25 2017 Big Data Inference Combining Hierarchical Bayes and Machine Learning to Improve Photometric Redshifts Josh Speagle 1, jspeagle@cfa.harvard.edu In collaboration with: Boris Leistedt

More information

Understanding How ConvNets See

Understanding How ConvNets See Understanding How ConvNets See Slides from Andrej Karpathy Springerberg et al, Striving for Simplicity: The All Convolutional Net (ICLR 2015 workshops) CSC321: Intro to Machine Learning and Neural Networks,

More information

Bayesian Networks Inference with Probabilistic Graphical Models

Bayesian Networks Inference with Probabilistic Graphical Models 4190.408 2016-Spring Bayesian Networks Inference with Probabilistic Graphical Models Byoung-Tak Zhang intelligence Lab Seoul National University 4190.408 Artificial (2016-Spring) 1 Machine Learning? Learning

More information

Machine Learning. Hal Daumé III. Computer Science University of Maryland CS 421: Introduction to Artificial Intelligence 8 May 2012

Machine Learning. Hal Daumé III. Computer Science University of Maryland CS 421: Introduction to Artificial Intelligence 8 May 2012 Machine Learning Hal Daumé III Computer Science University of Maryland me@hal3.name CS 421 Introduction to Artificial Intelligence 8 May 2012 g 1 Many slides courtesy of Dan Klein, Stuart Russell, or Andrew

More information

Inference of Galaxy Population Statistics Using Photometric Redshift Probability Distribution Functions

Inference of Galaxy Population Statistics Using Photometric Redshift Probability Distribution Functions Inference of Galaxy Population Statistics Using Photometric Redshift Probability Distribution Functions Alex Malz Center for Cosmology and Particle Physics, New York University 7 June 2016 Galaxy population

More information

GDR1 photometry. CU5/DPCI team

GDR1 photometry. CU5/DPCI team GDR1 photometry CU5/DPCI team What is, or isn't included GDR1 only provides G mean flux and error Derived mean magnitude, all entries Zero point used is in Vega system Zero point for AB system also available

More information

Dark Matter on Small Scales: Merging and Cosmogony. David W. Hogg New York University CCPP

Dark Matter on Small Scales: Merging and Cosmogony. David W. Hogg New York University CCPP Dark Matter on Small Scales: Merging and Cosmogony David W. Hogg New York University CCPP summary galaxy merger rates suggest growth of ~1 percent per Gyr galaxy evolution is over can we rule out CDM now

More information

Search for & Characterizing Small Planets with NASA s Kepler Mission

Search for & Characterizing Small Planets with NASA s Kepler Mission Search for & Characterizing Small Planets with NASA s Kepler Mission Eric Ford University of Florida SAMSI Astrostatistics Workshop September 21, 2012 Image credit: NASA/ESA/StSci Golden Age for Planet

More information

The Large Synoptic Survey Telescope

The Large Synoptic Survey Telescope The Large Synoptic Survey Telescope Philip A. Pinto Steward Observatory University of Arizona for the LSST Collaboration 17 May, 2006 NRAO, Socorro Large Synoptic Survey Telescope The need for a facility

More information

Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4)

Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4) Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4) Tom Loredo Dept. of Astronomy, Cornell University http://www.astro.cornell.edu/staff/loredo/bayes/ Bayesian

More information

Automated analysis: SDSS, BOSS, GIRAFFE

Automated analysis: SDSS, BOSS, GIRAFFE Automated analysis: SDSS, BOSS, GIRAFFE Tests with MILES spectra (R~2000) from the INT (Sanchez Blazquez et al. 2006) The same code (FERRE) Fitting data calibrated in flux and continuumnormalized Software

More information

The expected Gaia revolution in asteroid science: Photometry and Spectroscopy

The expected Gaia revolution in asteroid science: Photometry and Spectroscopy A. Cellino (INAF- Torino Observatory) P. Tanga, D. Hestroffer, K. Muinonen, A. Dell Oro, L. Galluccio The expected Gaia revolution in asteroid science: Photometry and Spectroscopy Although in situ exploration

More information

Wide-Field Radio Astronomy and the Dynamic Universe

Wide-Field Radio Astronomy and the Dynamic Universe CSIRO / Swinburne ; NASA Wide-Field Radio Astronomy and the Dynamic Universe Bryan Gaensler / @SciBry! Dunlap Institute! University of Toronto! with Cleo Loi, Kitty Lo,! Martin Bell, Keith Bannister,!

More information

1 A photometric probe for Pan-STARRS

1 A photometric probe for Pan-STARRS The Pan-STARRS Imaging Sky Probe B.R. Granett, K.C. Chambers, E.A. Magnier, K. Fletcher and the Pan-STARRS Project Team University of Hawai i Institute for Astronomy Abstract Photometric performance is

More information

RLW paper titles:

RLW paper titles: RLW paper titles: http://www.wordle.net Astronomical Surveys and Data Archives Richard L. White Space Telescope Science Institute HiPACC Summer School, July 2012 Overview Surveys & catalogs: Fundamental

More information

Astronomical "color"

Astronomical color Astronomical "color" What color is the star Betelgeuse? It's the bright star at upper left in this picture of Orion taken by a student at the RIT Observatory. Orange? Red? Yellow? These are all reasonable

More information

Classification & Information Theory Lecture #8

Classification & Information Theory Lecture #8 Classification & Information Theory Lecture #8 Introduction to Natural Language Processing CMPSCI 585, Fall 2007 University of Massachusetts Amherst Andrew McCallum Today s Main Points Automatically categorizing

More information

Hierarchical Probabilis.c Inference of Cosmic Shear

Hierarchical Probabilis.c Inference of Cosmic Shear Hierarchical Probabilis.c Inference of Cosmic Shear Sta$s$cal Challenges in Modern Astronomy VI June 9, 2016 Michael D. Schneider with Will Dawson, Josh Meyers Collaborators: D. Bard, D. Hogg, D. Lang,

More information

arxiv: v1 [astro-ph.im] 19 Oct 2016

arxiv: v1 [astro-ph.im] 19 Oct 2016 Do fast stellar centroiding methods saturate the Cramér-Rao lower bound? Mohammadjavad Vakili 1, David W. Hogg 1,2,3 arxiv:1610.05873v1 [astro-ph.im] 19 Oct 2016 mjvakili@nyu.edu ABSTRACT One of the most

More information

Milky Way star clusters

Milky Way star clusters Using Γα ια for studying Milky Way star clusters Eugene Vasiliev Institute of Astronomy, Cambridge MODEST-, 26 June Overview of Gaia mission Scanning the entire sky every couple of weeks Astrometry for

More information

MULTIPLE EXPOSURES IN LARGE SURVEYS

MULTIPLE EXPOSURES IN LARGE SURVEYS MULTIPLE EXPOSURES IN LARGE SURVEYS / Johns Hopkins University Big Data? Noisy Skewed Artifacts Big Data? Noisy Skewed Artifacts Serious Issues Significant fraction of catalogs is junk GALEX 50% PS1 3PI

More information

Astr As ome tr tr ome y I M. Shao

Astr As ome tr tr ome y I M. Shao Astrometry I M. Shao Outline Relative astrometry vs Global Astrometry What s the science objective? What s possible, what are fundamental limits? Instrument Description Error/noise sources Photon noise

More information

Introduction to SDSS -instruments, survey strategy, etc

Introduction to SDSS -instruments, survey strategy, etc Introduction to SDSS -instruments, survey strategy, etc (materials from http://www.sdss.org/) Shan Huang 17 February 2010 Survey type Status Imaging and Spectroscopy Basic Facts SDSS-II completed, SDSS-III

More information

CSE 473: Artificial Intelligence Autumn Topics

CSE 473: Artificial Intelligence Autumn Topics CSE 473: Artificial Intelligence Autumn 2014 Bayesian Networks Learning II Dan Weld Slides adapted from Jack Breese, Dan Klein, Daphne Koller, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 473 Topics

More information

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30 NED Mission: Provide a comprehensive, reliable and easy-to-use synthesis of multi-wavelength data from NASA missions, published catalogs, and the refereed literature, to enhance and enable astrophysical

More information

CS 188: Artificial Intelligence. Machine Learning

CS 188: Artificial Intelligence. Machine Learning CS 188: Artificial Intelligence Review of Machine Learning (ML) DISCLAIMER: It is insufficient to simply study these slides, they are merely meant as a quick refresher of the high-level ideas covered.

More information

Source Detection in the Image Domain

Source Detection in the Image Domain Source Detection in the Image Domain Astro 6525, Fall 2015 Shami Chatterjee Oct 2015 Source Identification Identify a source in the presence of noise. Localize it as best as possible given instrumental

More information

The Kepler Exoplanet Survey: Instrumentation, Performance and Results

The Kepler Exoplanet Survey: Instrumentation, Performance and Results The Kepler Exoplanet Survey: Instrumentation, Performance and Results Thomas N. Gautier, Kepler Project Scientist Jet Propulsion Laboratory California Institute of Technology 3 July 2012 SAO STScI 2012

More information

Classifying Galaxy Morphology using Machine Learning

Classifying Galaxy Morphology using Machine Learning Julian Kates-Harbeck, Introduction: Classifying Galaxy Morphology using Machine Learning The goal of this project is to classify galaxy morphologies. Generally, galaxy morphologies fall into one of two

More information

WFIRST is not just for Cold Planets: Transiting Planets with the WFIRST Microlensing Survey

WFIRST is not just for Cold Planets: Transiting Planets with the WFIRST Microlensing Survey is not just for Cold Planets: Transiting Planets with the Microlensing Survey Matthew Penny, Ben Montet, Jennifer Yee Microlensing Science Investiagion Team Ohio State University penny@astronomy.ohio-state.edu

More information

Revealing the Large Scale Distribution of Star Formation in the Milky Way with WISE

Revealing the Large Scale Distribution of Star Formation in the Milky Way with WISE Revealing the Large Scale Distribution of Star Formation in the Milky Way with WISE Xavier Koenig Yale University WISE @ 5 Conference Feb 11 2015 Collaborators: David Leisawitz Debbie Padgett Luisa Rebull

More information

Celeste: Scalable variational inference for a generative model of astronomical images

Celeste: Scalable variational inference for a generative model of astronomical images Celeste: Scalable variational inference for a generative model of astronomical images Jeffrey Regier 1, Brenton Partridge 2, Jon McAuliffe 1, Ryan Adams 2, Matt Hoffman 3, Dustin Lang 4, David Schlegel

More information

Photometric Products. Robert Lupton, Princeton University LSST Pipeline/Calibration Scientist PST/SAC PST/SAC,

Photometric Products. Robert Lupton, Princeton University LSST Pipeline/Calibration Scientist PST/SAC PST/SAC, Photometric Products Robert Lupton, Princeton University LSST Pipeline/Calibration Scientist PST/SAC 2018-02-27 PST/SAC, 2018-02-27 LSST2017 Tucson, AZ August 14-18, 2017 1 Outline Photometry Point Sources

More information

SDSS-IV MaStar: a Large, Comprehensive, and High Quality Empirical Stellar Library

SDSS-IV MaStar: a Large, Comprehensive, and High Quality Empirical Stellar Library 3rd International Workshop on Spectral Stellar Libraries ASI Conference Series, 2017, Vol. 14, pp 99 103 Editors: P. Coelho, L. Martins & E. Griffin SDSS-IV MaStar: a Large, Comprehensive, and High Quality

More information

Natural Language Processing. Classification. Features. Some Definitions. Classification. Feature Vectors. Classification I. Dan Klein UC Berkeley

Natural Language Processing. Classification. Features. Some Definitions. Classification. Feature Vectors. Classification I. Dan Klein UC Berkeley Natural Language Processing Classification Classification I Dan Klein UC Berkeley Classification Automatically make a decision about inputs Example: document category Example: image of digit digit Example:

More information

Hierarchical Bayesian Modeling of Planet Populations

Hierarchical Bayesian Modeling of Planet Populations Hierarchical Bayesian Modeling of Planet Populations You ve found planets in your data (or not)...... now what?! Angie Wolfgang NSF Postdoctoral Fellow, Penn State Why Astrostats? This week: a sample of

More information

CSC321 Lecture 7 Neural language models

CSC321 Lecture 7 Neural language models CSC321 Lecture 7 Neural language models Roger Grosse and Nitish Srivastava February 1, 2015 Roger Grosse and Nitish Srivastava CSC321 Lecture 7 Neural language models February 1, 2015 1 / 19 Overview We

More information

Data Management Plan Extended Baryon Oscillation Spectroscopic Survey

Data Management Plan Extended Baryon Oscillation Spectroscopic Survey Data Management Plan Extended Baryon Oscillation Spectroscopic Survey Experiment description: eboss is the cosmological component of the fourth generation of the Sloan Digital Sky Survey (SDSS-IV) located

More information

Supernovae with Euclid

Supernovae with Euclid Supernovae with Euclid Isobel Hook University of Oxford and INAF (Obs. Roma) Thanks to R. Nichol, M. Della Valle, F. Mannucci, A. Goobar, P. Astier, B. Leibundgut, A. Ealet Euclid Conference 17 18 Nov

More information

Data Release 5. Sky coverage of imaging data in the DR5

Data Release 5. Sky coverage of imaging data in the DR5 Data Release 5 The Sloan Digital Sky Survey has released its fifth Data Release (DR5). The spatial coverage of DR5 is about 20% larger than that of DR4. The photometric data in DR5 are based on five band

More information

Information Field Theory. Torsten Enßlin MPI for Astrophysics Ludwig Maximilian University Munich

Information Field Theory. Torsten Enßlin MPI for Astrophysics Ludwig Maximilian University Munich Information Field Theory Torsten Enßlin MPI for Astrophysics Ludwig Maximilian University Munich Information Field Theory signal field: degrees of freedom data set: finite additional information needed

More information

Énergie noire Formation des structures. N. Regnault C. Yèche

Énergie noire Formation des structures. N. Regnault C. Yèche Énergie noire Formation des structures N. Regnault C. Yèche Outline Overview of DE probes (and recent highlights) Hubble Diagram of supernovae Baryon accoustic oscillations Lensing Matter clustering (JLA)

More information

arxiv:astro-ph/ v1 17 Oct 2002

arxiv:astro-ph/ v1 17 Oct 2002 The K correction David W. Hogg 1,2, Ivan K. Baldry 3, Michael R. Blanton 1, and Daniel J. Eisenstein 4 ABSTRACT arxiv:astro-ph/0210394 v1 17 Oct 2002 The K correction corrects for the fact that sources

More information

Automated Classification of HETDEX Spectra. Ted von Hippel (U Texas, Siena, ERAU)

Automated Classification of HETDEX Spectra. Ted von Hippel (U Texas, Siena, ERAU) Automated Classification of HETDEX Spectra Ted von Hippel (U Texas, Siena, ERAU) Penn State HETDEX Meeting May 19-20, 2011 Outline HETDEX with stable instrumentation and lots of data ideally suited to

More information

Requirements for the Star Tracker Parallel Science Programme

Requirements for the Star Tracker Parallel Science Programme Requirements for the Star Tracker Parallel Science Programme Rømer System Definition Phase 2000/2001 Document no.: MONS/IFA/PL/RS/0003(1) Date: 22.04.2001 Prepared by: Hans Kjeldsen and Tim Bedding Checked

More information

Two New Cool Variable Stars in the Field of NGC 659

Two New Cool Variable Stars in the Field of NGC 659 92 Two New Cool Variable Stars in the Field of NGC 659 Steven P. Souza Department of Astronomy, Williams College, Williamstown, MA, 01267; ssouza@williams.edu Received December 11, 2012; revised December

More information

Weak Lensing: Status and Prospects

Weak Lensing: Status and Prospects Weak Lensing: Status and Prospects Image: David Kirkby & the LSST DESC WL working group Image: lsst.org Danielle Leonard Carnegie Mellon University Figure: DES Collaboration 2017 for LSST DESC June 25,

More information

Primer on statistics:

Primer on statistics: Primer on statistics: MLE, Confidence Intervals, and Hypothesis Testing ryan.reece@gmail.com http://rreece.github.io/ Insight Data Science - AI Fellows Workshop Feb 16, 018 Outline 1. Maximum likelihood

More information

Potential Synergies Between MSE and the ELTs A Purely TMT-centric perspective But generally applicable to ALL ELTs

Potential Synergies Between MSE and the ELTs A Purely TMT-centric perspective But generally applicable to ALL ELTs Potential Synergies Between MSE and the ELTs A Purely TMT-centric perspective But generally applicable to ALL ELTs Warren Skidmore, TMT Instrument System Scientist 2 nd May, 2018 IPAC Science Talk 1 TMT

More information

CPSC 340: Machine Learning and Data Mining. Regularization Fall 2017

CPSC 340: Machine Learning and Data Mining. Regularization Fall 2017 CPSC 340: Machine Learning and Data Mining Regularization Fall 2017 Assignment 2 Admin 2 late days to hand in tonight, answers posted tomorrow morning. Extra office hours Thursday at 4pm (ICICS 246). Midterm

More information