Gaia data publication

Similar documents
Astrometry in Gaia DR1

The Gaia Mission. Coryn Bailer-Jones Max Planck Institute for Astronomy Heidelberg, Germany. ISYA 2016, Tehran

Gaia Data Release 1: Datamodel description

GREAT. Kick-off meeting

The overture to a new era in Galactic science: Gaia's first data release. Martin Altmann Centre for Astrophysics of the University of Heidelberg

Gaia Data Processing - Overview and Status

Gaia News:Counting down to launch A. Vallenari. INAF, Padova Astronomical Observatory on behalf of DPACE

Building the cosmic distance scale: from Hipparcos to Gaia

Gaia Status & Early Releases Plan

Milky Way star clusters

The Three Dimensional Universe, Meudon - October, 2004

Gaia s view of star clusters

ESTIMATING DISTANCES FROM PARALLAXES. III. DISTANCES OF TWO MILLION STARS IN THE Gaia DR1 CATALOGUE

Gaia. Stereoscopic Census of our Galaxy. one billion pixels for one billion stars

Gaia: Mapping the Milky Way

Tristan Cantat-Gaudin

Gaia DR2 astrometry. IAU 30 GA Division A: Fundamental Astronomy Vienna, 2018 August 27.

Exoplanetary transits as seen by Gaia

"The Gaia distances" Jan Rybizki (MPIA CU8) Gaia Data Workshop Heidelberg 06/19/18

Stellar distances and velocities. ASTR320 Wednesday January 24, 2018

From Gaia frame to ICRF-3 3?

Measuring the Properties of Stars (ch. 17) [Material in smaller font on this page will not be present on the exam]

ESASky, ESA s new open-science portal for ESA space astronomy missions

Catalog Information and Recommendations

Standard candles in the Gaia perspective

The Astrometry Satellite Gaia

1.0. σ M B0 B5 A0 A5 F0 F5 G0 G5 K0 K B0 B5 A0 A5 F0 F5 G0 G5 K0 K5

Asterseismology and Gaia

Data challenges of the Virtual Observatory in Time Domain Astronomy

Astr As ome tr tr ome y I M. Shao

GDR1 photometry. CU5/DPCI team

Gaia data in the CDS portal. Sébastien Derriere, Mark Allen, Thomas Boch & CDS team Observatoire de Strasbourg, France

Distance and extinction determination for stars in LAMOST and APOGEE survey

The cosmic distance scale

Selection of stars to calibrate Gaia

The Impact of Gaia on Our Knowledge of Stars and Their Planets

Chapter 8: The Family of Stars

Linking the ICRF and the future Gaia optical frame

Gaia data access scenarios summary

First Cycle Processing of Gaia data

The Gaia mission: status, problems, opportunities

Gaia Photometric Data Analysis Overview

Lecture 25: The Cosmic Distance Scale Sections 25-1, 26-4 and Box 26-1

D4.2. First release of on-line science-oriented tutorials

Parallaxes with Hubble Space Telescope

5. A particular star has an angle of parallax of 0.2 arcsecond. What is the distance to this star? A) 50 pc B) 2 pc C) 5 pc D) 0.

Photometric relationships between Gaia photometry and existing photometric systems

An application of the KNND method for detecting nearby open clusters based on Gaia-DR1 data

Gaia: a new vision of our Galaxy and our neighbours

Synergies between E-ELT and space instrumentation for extrasolar planet science

The Plato Input Catalog (PIC)

High-performance computing in Java: the data processing of Gaia. X. Luri & J. Torra ICCUB/IEEC

DETERMINATION OF STELLAR ROTATION WITH GAIA AND EFFECTS OF SPECTRAL MISMATCH. A. Gomboc 1,2, D. Katz 3

Optical TDE hunting with Gaia

Towards an accurate alignment of the VLBI frame with the future Gaia optical frame

Techniques for measuring astronomical distances generally come in two variates, absolute and relative.

CCD astrometry and UBV photometry of visual binaries

Large Synoptic Survey Telescope

ECLIPSING BINARIES: THE ROYAL ROAD. John Southworth (Keele University)

Local Volume, Milky Way, Stars, Planets, Solar System: L3 Requirements

GAIA: SOLAR SYSTEM ASTROMETRY IN DR2

(a) B-V 6 V. (b) B-V

GENIUS. Final Report for WP5

Structure & Evolution of Stars 1

Star clusters before and after Gaia Ulrike Heiter

Outline. ESA Gaia mission & science objectives. Astrometric & spectroscopic census of all stars in Galaxy to G=20 mag.

Relativity and Astrophysics Lecture 15 Terry Herter. RR Lyrae Variables Cepheids Variables Period-Luminosity Relation. A Stellar Properties 2

Massive OB stars as seen by Gaia

ABSTRACT. to assume that the stars in this sample share similar statistical properties.

Introduction The Role of Astronomy p. 3 Astronomical Objects of Research p. 4 The Scale of the Universe p. 7 Spherical Astronomy Spherical

Gaia, LSST and binaries

arxiv: v1 [astro-ph] 12 Nov 2008

489 THE METALLICITY DISTRIBUTION OF LATE-TYPE DWARFS AND THE HIPPARCOS HR DIAGRAM M. Haywood, J. Palasi, A. Gomez, L. Meillon DASGAL, URA 335 du CNRS

ELSA Mid-Term Review meeting Coordinator's report

Lecture 12: Distances to stars. Astronomy 111

Simulations of the Gaia final catalogue: expectation of the distance estimation

Thoughts on future space astrometry missions

Gaia, the universe in 3D: an overview of the mission. Gaia, the universe in 3D: an overview of the mission. X. Luri, ICCUB/IEEC

Basic Properties of the Stars

How to calibrate interferometric data

Gravitational microlensing. Exoplanets Microlensing and Transit methods

Radio-optical outliers a case study with ICRF2 and SDSS

A statistical test for validation of Gaia data

Tutorial: Exploring Gaia data with TOPCAT and STILTS

ASTR Look over Chapter 15. Good things to Know. Triangulation

JINA Observations, Now and in the Near Future

arxiv: v1 [astro-ph.ep] 4 Nov 2014

Gaia. Stereoscopic Census of our Galaxy. one billion pixels for one billion stars

Interferometric orbits of new Hipparcos binaries

Gaia Data Release 1. Documentation release 1.1. European Space Agency and Gaia Data Processing and Analysis Consortium

Quasometry, Its Use and Purpose

VISTA HEMISPHERE SURVEY DATA RELEASE 1

Access to massive catalogues in the Gaia archive: a new paradigm

468 Six dierent tests were used to detect variables, with dierent sensitivities to light-curve features. The mathematical expression for the limiting

CCD astrometry and instrumental V photometry of visual double stars,

Astrometric planet detectability with Gaia, a short AGISLab study

The Milky Way Galaxy (ch. 23)

5) Which stage lasts the longest? a) viii b) I c) iv d) iii e) vi

Observations of gravitational microlensing events with OSIRIS. A Proposal for a Cruise Science Observation

Science in the Virtual Observatory

Transcription:

Gaia data publication X. Luri, Univ. Barcelona, ICCUB-IEEC

The archive and the DPAC CU9

Coordination units Data processing centers

DPAC coordination units CU1: System Architecture CU2: Data Simulations CU3: Core Processing CU4: Object Processing (multiple objects) CU5: Photometric Processing CU6: Spectroscopic Processing CU7: Variability Processing CU8: Astrophysical Parameters CU9: Catalogue Access

Archive (CU9) AO Two documents sent to ESA Formal response: 72 pages + support letters Austria, France (x2), Germany, Italy, Netherlands, Portugal, Spain, Sweden, UK Appendix: 114 pages, bios + detailed WP description Used for the CU9 SPD

CU9/Archive goals A thoroughly validated and well documented set of Gaia data, ready for scientific exploitation. A functional, single point access to all Gaia science data, including user support. A defined API to the science data repository to allow the development of high throughput access applications and visualization tools. A set of advanced applications to allow for defined manipulation and visualization of the data. A set of tools and content for outreach, tailored for different targets (media, academic and general public) to maximize the social impact of the mission.

Product tree

Archive overview Core (ESAC)

Schedule tied to DPAC data releases

Data rights Bound by the Science Management Plan (ESA/SPC(2006)45) which states that there will not be proprietary data rights for Gaia. Further reinforced by DPAC s own data access policy (FM- 044) which is also binding for CU9. Within CU9 access to actual Gaia data will be restricted as much as possible before actual data releases. Instead simulated Gaia data generated in coordination with CU2 (see Section 10.5) will be extensively used for development, testing and validation of software. In the cases where real data needs to be used the members of CU9 will be bound by the publication policy (FM-039). To this respect, quoting the AO: The SMP identifies the GST as the entity defining procedures for any early access to the data.

Data Release scenario

First release: 14 September 2016 Positions (α, δ) and G magnitudes for 1.1 billion stars Five-parameter astrometric solution - positions, parallaxes, and proper motions - for about 2 milllion stars (TGAS) Photometric data of RR Lyrae and Cepheid variable stars QSO special-solution catalogue

Second release: Q4 2017 Five-parameter astrometric solutions of objects with single-star behaviour Integrated BP/RP photometry + estimated stellar parameters from these data Mean radial velocities for objects showing no radialvelocity variation and for which an adequate synthetic template could be selected Photometric data of RR Lyrae and Cepheid variable stars SSO

Third release: 2018 (TBC) Orbital solutions, together with the system radial velocity and five-parameter astrometric solutions, for binaries having periods between 2 months and 75% of the observing time Object classification and astrophysical parameters, together with BP/RP spectra and/or RVS spectra they are based on for spectroscopically and (spectro-)photometrically well-behaved objects. Mean radial velocities for those stars not showing variability and with available atmospheric-parameter estimates.

Fourth release: 2019 (TBC) Variable-star classifications together with the epoch photometry used for the stars. Solar-system results with preliminary orbital solutions and individual epoch observations. Non-single star catalogue

Fifth release: 2022 (TBC) Full astrometric, photometric, and radial-velocity catalogues. All available variable-star and non-single-star solutions. Source classifications (probabilities) plus multiple astrophysical parameters (derived from BP/RP, RVS, and astrometry) for stars, unresolved binaries, galaxies, and quasars. Some parameters may not be available for faint(er) stars. An exo-planet list. All epoch and transit data for all sources. All ground-based observations made for dataprocessing purposes.

+ Additional releases in case of mission extension

The archive

The Gaia data Where? Main Gaia archive: ESAC Partner data centres: AIP, ARI, ASI, BCN, CDS Affiliated and other data centres, enthusiasts: tens What? tgas_source (~2M 5 parameter astrometry): all gala_source (~1.1G positions): most Variability (599 cepheids, 2595 RR Lyrae), QSOs (2191 Xmatch (2MASS, PPMXL, SDSS9, UCAC4, URAT1, WISE) External catalogues (for Xmatch): ESAC What else @ESAC?: TAP+ (data base, user space, share), cross-match

The Gaia archive

The simple query interface

The ADQL search tab

+ Additional interface through TAP (used by tools like TopCat)

Documentation

Gaia Archive. Includes help and tutorials https://archives.esac.esa.int/gaia Gaia DR1 papers http://www.cosmos.esa.int/web/gaia/dr1#a&a Online documentation (361 pages) http://gaia.esac.esa.int/documentation/gdr1/index.html Data model documentation https://gaia.esac.esa.int/documentation/gdr1/datamodel/ ADQL: GAVO short course, UK ROE cookbook http://docs.g-vo.org/adql-gaia/html/index.html https://gaia.ac.uk/science/gaia-data-release-1/adql-cookbook Gaia Helpdesk. https://support.cosmos.esa.int/gaia/

Partner data centres CDS, ARI, AIP, ASI

Cone search: GAVO-ARI. http://gaia.ari.uni-heidelberg.de/ SQL: GAVO-AIP. https://gaia.aip.de/ CDS: Vizier, Aladdin, cross-match http://cdsxmatch.u-strasbg.fr http://tapvizier.u-strasbg.fr/adql/ Simbad: forthcoming future. http://simbad.u-strasbg.fr/simbad/

+ Further distribution from affiliate data centres (i.e. NAOJ)

Future developments

Bringing code to the data: expanded archive functionality

Gaia Added Value Interface (Gaia-AVI) (docker-based system) GENIUS data mining framework (Spark-Hadoop based system)

A possible cloudy future

Working with astrometric data - warnings and caveats -

Error-free data No random errors No biases No correlations Complete sample No censorships Direct measurements No transformations No assumptions Scientist s dream

Bias: Errors 1: biases your measurement is systematically too large or too small For DR1 parallaxes: Probable global zero-point offset present; -0.04 mas found during validation Colour dependent and spatially correlated systematic errors at the level of 0.2 mas Over large spatial scales, the parallax zero-point variations reach an amplitude of 0.3 mas Over a few smaller areas (2 degree radius), much larger parallax biases may occur of up to 1 mas There may be specific problems in a few individual cases

Global zero point from QSO parallaxes

Global zero point from Cepheids

Regional effects from QSOs (ecliptic coordinates)

BAM1 BAM2 WFS1 WFS2 SM1 SM2 AF1 AF2 AF3 AF4 AF5 AF6 AF7 AF8 AF9 BP RP RVS1 RVS2 RVS3 Split FoV early late Gaia DR1 Workshop - ESAC 2016 Nov 3 L. Lindegren: Astrometry in Gaia DR1 4 45

Regional effects from split FOV solutions (equatorial coordinates)

How to take this into account You can introduce a global zero-point offset to use the parallaxes (suggested -0.04 mas) You cannot correct the regional features: if we could, we would already have corrected them. We have indications that these zero points may be present, but no more. For most of the sky assume an additional systematic error of 0.3 mas; your derived standard errors for anything cannot go below this value ϖ ± σ ϖ (random) ± 0.3 mas (syst.) For a few smaller regions be aware that the systematics might reach 1 mas This is possibly the sole aspect in which Gaia DR1 is not better than Hipparcos (apart from the incompleteness for the brightest stars)

More specifically: treat separately random error and bias, but if you must combine them, a worst case formula can be as follows For individual parallaxes: to be on the safe side add 0.3 mas to the standard uncertainty Total sqrt( 2 Std+0.3 2 ) When averaging parallaxes for groups of stars: the random error will decrease as sqrt(n) but the systematic error (0.3 mas) will not decrease final sqrt( 2 averagestd+0.3 2 ) where averagestd decrease is the formal standard deviation of the average, computed in the usual way from the sigmas of the individual values in the average (giving essentially the sqrt(n) reduction). Don t try to get a zonal correction from previous figures, it s too risky

For DR1 proper motions and positions: In this case Gaia data is the best available, by far. We do not have means to do a check as precise as the one done for parallaxes, but there are no indications of any significant bias For positions remember that for comparison purposes you will likely have to convert them to another epoch. You should propagate the errors accordingly.

Comparison with Tycho-2 shows that catalogue s systematics (not Gaia s)

Errors 2: random errors Random error: your measurements are randomly distributed around the true value Each measurement in the catalogue comes with a formal error Random errors in Gaia are quasi-normal. The formal error can be assimilated to the variance of a normal distribution around the true value. Published formal errors for Gaia DR1 may be slightly overestimated

Warning: comparison with Hipparcos shows deviation from normality beyond ~2 To take into account for outlier analysis

Warning: when comparing with other sources of trigonometric parallaxes take into account the properties of the error distributions Observations TGAS vs Hipparcos Simulations The slope at small parallaxes is not a bias in either TGAS or HIP, simply due to the different size of the errors in the two catalogues!

Eclipsing binaries parallaxes vs TGAS arxiv:1609.05390v3 Simulation The overall slope is due to the different error distributions in parallax (lognormal for photometric, normal for trigonometric)

Errors 3: correlations Correlation: the measurements of several quantities are not independent from each other Whenever you take linear combinations of such quantities, the correlations have to be taken into account in the error calculus ( and even more so for non-linear functions ) The errors in the five astrometric parameters provided are not independent The ten correlations between these parameters are provided in the Gaia DR1 archives (correlation matrix)

Examples of problematic use: Simple epoch propagation (!) pos&pm Calculation of proper directions pos&pm&parallax Proper motion in a given direction on the sky (other than north-south or east-west) proper-motion components Proper motion components in galactic or ecliptic coordinates proper-motion components More complex, non-linear example: Calculating the transversal velocities of a set of stars The resulting dispersion of velocities is influenced by the errors in parallax and in proper motion; thus 3-dimensional case. Its determination can not be done using the parallax and proper motion errors separately; the correlations have to be taken into account But this time it s non-linear! The error distribution will no longer be Gaussian. The A matrix of the previous page will become the Jacobian matrix of the local derivatives of the transversal velocity wrt parallax and pm components

Beware: large and unevenly distributed correlations in DR1; example: PmRA-vs.-Parallax correlation

Chapter 4: Transformations Transformations: when the quantity you want to study is not the quantity you observe Usually you want distances, not parallaxes Usually you want spatial velocities, not proper motions

Warning: when using a transformed quantity the error distribution also is transformed This is especially crucial for the calculation of distances from parallaxes And even more so for the calculation of luminosities from parallaxes A symmetrical, well behaved error in parallax is transformed into an asymmetrical error in distance

Measured parallax/true parallax Error distribution comparison: parallax versus distance plotted for sigma(parallax)=0.21*true parallax Transformation: distance = 1 / parallax Measured distance/true distance

Measured parallax/true parallax Error distribution comparison: parallax versus distance Transformation: distance = 1 / parallax mode median mean rms always infinite Measured distance/true distance

Measured parallax/true parallax Error distribution comparison: parallax versus distance Transformation: distance = 1 / parallax mode median mean rms always infinite Measured distance/true distance

Sample simulation with a parallax error of 2mas True distance vs. distance from parallax Overestimation of distances by 14pc=14% on average, and of luminosities by over 40% on average.

How to take this into account Avoid using transformations as much as possible If unavoidable: Do fits in the plane of parallaxes (e.g. PL relations using ABL method*) where errors are well behaved Do any averaging in parallaxes and then do the transformation (e.g. distance to an open cluster) Always estimate the remaining effect (analytically or with simulations) *Astrometry-Based Luminosity (ABL) method This quantity is: - related to luminosity (sqrt of inverse luminosity) - a linear function of parallax - thus nicely behaved - thus can be averaged safely

Also beware of additional assumptions For instance about the absorption when calculating absolute magnitudes from parallaxes

Chapter 5: Sample censorships Completeness/representativeness: we have the complete population of objects or at least a subsample which is representative for a given purpose DR1 is a very complex dataset, its completeness or representativeness can not be guaranteed for any specific purpose

Significant completeness variations as a function of the sky position

Complex selection of astrometry (e.g. Nobs)

How to take this into account Very difficult, will depend on your specific purpose Analyze if the problem exists, and try to determine if the known censorships are correlated with the parameter you are analyzing (see validation paper) At least do some simulations to evaluate the possible effects

IMPORTANT: do not make things worse by adding your own additional censorships This is specially important for parallaxes Avoid removing negative parallaxes; this removes information and biases the sample for distant stars Avoid selecting subsamples on parallax relative error. This also removes information and biases the sample for distant stars Use instead fitting methods able to use all available data (e.g. Bayesian methods) and always work on the observable space (e.g. on parallaxes, not on distances or luminosities)

Example: Original (complete) dataset (errors in parallax of 2mas) Average diff. of parallaxes = 0.002 mas

Example: removing negative parallaxes Favours large parallaxes Average diff. of parallaxes = 0.65 mas

Example: removing sigmapar/par > 50% Favours errors making parallax larger Observed parallaxes systematically too large Average diff. of parallaxes = 2.2 mas

Example: truncation by observed parallax Favours objects at large distances (small true parallax) Consequence: Near to the horizon you will e.g. get an overestimate of the star density; and an underestimate of the mean luminosity of the selected stars.

Thank you

During this presentation - about 1 million stars were measured by Gaia, - roughly 10 million astrometric measurements were taken, - about 300,000 spectra were made of 100,000 stars