SIO 210: Data analysis methods L. Talley, Fall Sampling and error 2. Basic statistical concepts 3. Time series analysis

Similar documents
SIO 210: Data analysis

SIO 210 CSP: Data analysis methods L. Talley, Fall Sampling and error 2. Basic statistical concepts 3. Time series analysis

EVALUATION OF THE GLOBAL OCEAN DATA ASSIMILATION SYSTEM AT NCEP: THE PACIFIC OCEAN

Lab 12: El Nino Southern Oscillation

The minimisation gives a set of linear equations for optimal weights w:

CHAPTER 2 DATA AND METHODS. Errors using inadequate data are much less than those using no data at all. Charles Babbage, circa 1850

NOAA In Situ Satellite Blended Analysis of Surface Salinity: Preliminary Results for

Inter-comparison of Historical Sea Surface Temperature Datasets

Inter-comparison of Historical Sea Surface Temperature Datasets. Yasunaka, Sayaka (CCSR, Univ. of Tokyo, Japan) Kimio Hanawa (Tohoku Univ.

SIO 210 Physical properties of seawater (Lectures 2 and 3) Fall, 2016 L. Talley

NOTES AND CORRESPONDENCE. A Quantitative Estimate of the Effect of Aliasing in Climatological Time Series

Investigate the influence of the Amazon rainfall on westerly wind anomalies and the 2002 Atlantic Nino using QuikScat, Altimeter and TRMM data

Ocean data assimilation for reanalysis

MERIDIONAL OVERTURNING CIRCULATION: SOME BASICS AND ITS MULTI-DECADAL VARIABILITY

Lesson IV. TOPEX/Poseidon Measuring Currents from Space

ENSO Outlook by JMA. Hiroyuki Sugimoto. El Niño Monitoring and Prediction Group Climate Prediction Division Japan Meteorological Agency

the 2 past three decades

Forced and internal variability of tropical cyclone track density in the western North Pacific

The role of the midlatitude ocean in sub-seasonal prediction

Diagnosing the Climatology and Interannual Variability of North American Summer Climate with the Regional Atmospheric Modeling System (RAMS)

JP1.7 A NEAR-ANNUAL COUPLED OCEAN-ATMOSPHERE MODE IN THE EQUATORIAL PACIFIC OCEAN

Predictability of North Atlantic Sea Surface Temperature and Ocean Heat Content

2013 ATLANTIC HURRICANE SEASON OUTLOOK. June RMS Cat Response

552 Final Exam Preparation: 3/12/14 9:39 AM Page 1 of 7

(chose the sign to ensure that it is evaporative)

OCN/ATM/ESS 587. Ocean circulation, dynamics and thermodynamics.

Module Contact: Dr Xiaoming Zhai, ENV Copyright of the University of East Anglia Version 2

SIO 210 Problem Set 2 October 17, 2011 Due Oct. 24, 2011

Preliminary study of multi-year ocean salinity trends with merged SMOS and Aquarius data.

CHAPTER 7 Ocean Circulation Pearson Education, Inc.

Atmospheric circulation analysis for seasonal forecasting

SIO 210 Final Exam December 10, :30 2:30 NTV 330 No books, no notes. Calculators can be used.

SUPPLEMENTARY INFORMATION

an accessible interface to marine environmental data Russell Moffitt

Proxy reconstructions of Pacific decadal variability. March 8, 2006

Separation of a Signal of Interest from a Seasonal Effect in Geophysical Data: I. El Niño/La Niña Phenomenon

Optimal Spectral Decomposition (OSD) for Ocean Data Analysis

SIO 210: Dynamics VI: Potential vorticity

2 Observing the Ocean Ships Navigation The Electronics Era 16

Results of oceanographic analyses conducted under JARPA and possible evidence of environmental changes.

ATMOSPHERIC MODELLING. GEOG/ENST 3331 Lecture 9 Ahrens: Chapter 13; A&B: Chapters 12 and 13

PROCESSING RECORD SCRIPPS INSTITUTION OF OCEANOGRAPHY ARCHIVES. Physical Description: 13 mss, ½ mss, 1 shoe box, 1 card box, oversize, slides

Optimal Spectral Decomposition (OSD) for GTSPP Data Analysis

An Introduction to Coupled Models of the Atmosphere Ocean System

ENVIRONMENTAL DATA ANALYSIS WILLIAM MENKE JOSHUA MENKE WITH MATLAB COPYRIGHT 2011 BY ELSEVIER, INC. ALL RIGHTS RESERVED.

Climate Change Impacts on the Marine Environment

Uncertainty in Ocean Surface Winds over the Nordic Seas

Impacts of modes of climate variability, monsoons, ENSO, annular modes

C

Ocean Currents Student Activity Book

Small scale mixing in coastal areas

First-Order Draft Chapter 3 IPCC WG1 Fourth Assessment Report

Reprint 675. Variations of Tropical Cyclone Activity in the South China Sea. Y.K. Leung, M.C. Wu & W.L. Chang

Ocean Data Assimilation for Seasonal Forecasting

JMA s Seasonal Prediction of South Asian Climate for Summer 2018

Studying the Ocean Using Live Data

IPCC AR5 WG1 - Climate Change 2013: The Physical Science Basis. Nandini Ramesh

Will it be a Good Ski Season? Correlation between El Niño and U.S. Weather

The Atmospheric Circulation

The Stratospheric Link Between the Sun and Climate

Variability of the Tropical Ocean Surface Temperatures at Decadal Multidecadal Timescales. Part I: The Atlantic Ocean

Seasonal Climate Outlook for South Asia (June to September) Issued in May 2014

Tropical Pacific Ocean model error covariances from Monte Carlo simulations

NOTES AND CORRESPONDENCE. El Niño Southern Oscillation and North Atlantic Oscillation Control of Climate in Puerto Rico

How typical are current conditions?

Global Ocean Monitoring: A Synthesis of Atmospheric and Oceanic Analysis

lecture 11 El Niño/Southern Oscillation (ENSO) Part II

Lecture 8: Natural Climate Variability

(Received 9 June 1997; in revised form 29 August 1997; accepted 29 August 1997)

column Shelby A. Jones Fall 2014 Lynne Talley SIO 210: Introduction to Physical Oceanography

On Sampling Errors in Empirical Orthogonal Functions

Introduction of products for Climate System Monitoring

SIO 210: Dynamics VI (Potential vorticity) L. Talley Fall, 2014 (Section 2: including some derivations) (this lecture was not given in 2015)

June 1989 T. Nitta and S. Yamada 375. Recent Warming of Tropical Sea Surface Temperature and Its. Relationship to the Northern Hemisphere Circulation

Climate impact on interannual variability of Weddell Sea Bottom Water

Lecture 4:the observed mean circulation. Atmosphere, Ocean, Climate Dynamics EESS 146B/246B

Q.1 The most abundant gas in the atmosphere among inert gases is (A) Helium (B) Argon (C) Neon (D) Krypton

Pacific HYCOM. E. Joseph Metzger, Harley E. Hurlburt, Alan J. Wallcraft, Luis Zamudio and Patrick J. Hogan

NOTES AND CORRESPONDENCE. On the Seasonality of the Hadley Cell

Winter Forecast for GPC Tokyo. Shotaro TANAKA Tokyo Climate Center (TCC) Japan Meteorological Agency (JMA)

3. Midlatitude Storm Tracks and the North Atlantic Oscillation

From El Nino to Atlantic Nino: pathways as seen in the QuikScat winds

Contents of this file

Characteristics of Storm Tracks in JMA s Seasonal Forecast Model

Cape Verde. General Climate. Recent Climate. UNDP Climate Change Country Profiles. Temperature. Precipitation

Global Ocean Tripole and Climate Variability

1.6 Correlation maps CHAPTER 1. DATA ANALYSIS 47

Fourier Analysis. 19th October 2015

ANNUAL CLIMATE REPORT 2016 SRI LANKA

PRMS WHITE PAPER 2014 NORTH ATLANTIC HURRICANE SEASON OUTLOOK. June RMS Event Response

The increasing intensity of the strongest tropical cyclones

MC-KPP: Efficient, flexible and accurate air-sea coupling

Snapshots of sea-level pressure are dominated in mid-latitude regions by synoptic-scale features known as cyclones (SLP minima) and anticyclones (SLP

Available online at ScienceDirect. Procedia Environmental Sciences 33 (2016 ) Rahmat Hidayat*

lecture 10 El Niño and the Southern Oscillation (ENSO) Part I sea surface height anomalies as measured by satellite altimetry

Retrospective El Niño Forecasts Using an Improved Intermediate Coupled Model

ATSR SST Observations of the Tropical Pacific Compared with TOPEX/Poseidon Sea Level Anomaly

An Improved Method for Analyzing Sparse and Irregularly Distributed SST Data on a Regular Grid: The Tropical Pacific Ocean

Thermohaline and wind-driven circulation

Q & A on Trade-off between intensity and frequency of global tropical cyclones

Transcription:

SIO 210: Data analysis methods L. Talley, Fall 2016 1. Sampling and error 2. Basic statistical concepts 3. Time series analysis 4. Mapping 5. Filtering 6. Space-time data 7. Water mass analysis Reading: DPO Chapter 6 Look only at the less mathematical parts (skip sections on pdfs, least squares, EOFs and OMP except to note that the material is there) 6.1, 6.2, 6.3.1, 6.4, 6.5, 6.6.2, 6.7.1, 6.7.2 (6.3.3, 6.6.1 of interest for dynamics students)

1. Sampling and error: definitions Sampling (DPO Section 6.1) Synoptic sampling Climatology Mean Anomaly (difference between synoptic measurement and a mean

1. Sampling: mean and anomaly Climatology (mean): mean field based on large data base, usually covering many years Climatologyo of sea (Reynolds October Mean) Observed Field (Oct. 2012) Anomaly: Oct. 2012 Minus Reynolds October climatology http://www.pmel.noaa.gov/tao/

DPO Section 6.2 (PRECISION) 1. Sampling and error Random error: fluctuations (in either direction) of measured values due to precision limitations of the measurement device. Random error is quantified by the variance or standard deviation. (ACCURACY) Systematic error (bias): offset, high or low, which cannot be determined through statistical methods used on the measurements themselves. An oceanographic example: Two or more technical groups measure the same parameters (e.g. temperature, nutrients or oxygen, etc). The mean values the groups obtain differ because of differences in methods, chemical standards, etc. Error can only be evaluated by comparison of the two sets of measurements with each other or with an absolute standard.

2. Basic statistical concepts: Mean, variance, standard deviation (DPO Section 6.3.1) Mean: x = 1 N N x i i=1 Anomaly: x = x mean + x anomaly ; therefore x anomaly = x-x mean Variance: N σ 2 = 1 (x N-1 i -x) 2 i=1 N = 1 N-1 [ (x i) 2 i=1 - N 1 N 2 ( x i) ] i=1 Standard deviation of the measurements: σ (characteristic of the phenomenon) Standard error: σ/ N (characteristic of the SAMPLING since it depends on N)

2. Basic statistical concepts: Standard deviation vs. standard error Standard deviation is a measure of variability in the field that is measured. Standard error is a measure of how well the field is sampled. Salinity at 500 m Standard deviation Talley SIO 210 (2016) Sampling Standard error DPO Fig.10/10/16 6.13

3. Time series analysis (DPO 6.3.1, 6.3.3) A time series is a data set collected as a function of time Examples: current meter records, sea level records, temperature at the end of the SIO pier Some common analysis methods: A. Display the data (simply plot)! (always useful) B. Mean, variance, standard deviation, etc. (example not shown) C. Covariance and correlation of different time series with each other D. Spectral (Fourier) analysis

3.A. Display time series (a) Time series of temperature at Fanning Island (Pacific Ocean) from the NCAR Community Ocean Model. DPO FIGURE 6.2

3.A. Display time series Examples of time series plots: (a) property/time, (b) time series of profiles, (c) current speed and direction, and (d) stick diagram for data of (c). Talley SIO 210 (2016) 10/10/16

3.A. Display time series: Hovmöller diagram (time on one axis, space on the other) Equatorial Pacific sea level height anomaly from satellite (High sea level anomaly - El Nino!) (Low propagating west to east)

3.C. Time series analysis: covariance (DPO section 6.3.3) Covariance and correlation: integral of the product of two time series, can be with a time lag. (Autocorrelation is the time series with itself with a time lag.) Integral time scale: time scale of the autocorrelation (definitions vary) crude definition might be at what time lag does the autocorrelation drop to zero (which is the decorrelation timescale) (First calculate the autocorrelation for a large number of time lags.) Degrees of freedom: total length of record divided by the integral time scale that gives how many realizations of the phenomenon you have. Good to have about 10.

3.C. Time series: covariance and correlation (DPO Section 6.3.3) Covariance: how two variables are related statistically. cov(x,y) = 1 (x i x )(y i y ) N 1 N i=1 Correlation: covariance divided by the product of the (sample) standard deviations of the two variables. ρ x,y = cov(x,y) σ x σ y Correlations have values from 0 to 1. How to tell if a correlation is significant?

3.C. Time series: autocorrelation and timescales (a) Time series of temperature at Fanning Island (Pacific Ocean) from the NCAR Community Ocean Model. (b) Autocorrelation normalized to a maximum value of 1 (biased estimate with averages divided by N). (c and d) Autocorrelation (unbiased estimate with averages divided by N n). Source: From Gille (2005). Decorrelation time scale: 1 st 0 crossing. Integral time scale is a little longer (based on integral of the autocorrelation) DPO FIGURE 6.2

3.C. Time series analysis: confidence intervals (DPO Section 6.3.3) Confidence intervals: based on degrees of freedom and assumptions about statistical distribution 95% confidence interval: 95% probability that a value is within the standard error of the mean (times a t-test quantity that you look up).

3.C. Time series analysis: confidence intervals (DPO Section 6.3.3) Example of time series with confidence intervals. Global ocean heat content (10 22 J) for the 0 to 700m layer, based on Levitus et al. (2005a; black curve), Ishii et al. (2006; full record gray curve and larger error bar), and Willis et al. (2004; darker gray after 1993 and shorter error bar). Shading and error bars denote the 90% confidence interval. Compare with Figure S15.15 seen on the textbookweb site from Domingues et al. (2008) which uses improved observations: NEXT SLIDE Talley SIO 210 (2016) 10/10/16

3.C. Time series analysis: confidence intervals (DPO Section 6.3.3) Figure S15.17 on the textbook Website from Domingues et al. (2008) which uses improved observations Bottom line: uncertainty estimates are only as good as the underlying data! Talley SIO 210 (2016) 10/10/16

3.C. Time series analysis: correlation example Example of correlation of different time series with each other North Atlantic Oscillation index (time series) Correlation with surface temperature and with surface pressure (done at each point, so each lat/lon gives a time series to correlate with NAO index) Talley SIO 210 (2016) Similar to DPO S15.2 and others S15 figures 10/10/16

3.D. Time series analysis: spectral analysis (DPO Section 6.5.3) Spectral (Fourier) analysis of a time series is used to determine its frequency distributions. Underlying concept: any time series can be decomposed into a continuous set of sinusoidal functions of varying frequency. Spectrum: amplitude of each of the frequency components. Very useful for detecting things like tides, inertial motions, that have specific forcing frequencies.

3.D. Time series analysis: spectrum (DPO Section 6.5.3) Example of time series, spectra, and spectral confidence intervals. (a) Velocity (cm/sec) stick plot, lowpassed at 100 hours, from 5 deep current meters at different depths on one mooring in the Deep Western Boundary Current in Samoan Passage (see Figure 10.16). The vertical direction is along the passage axis. (b) Spectra from the same current meters, offset by one decade. The 95% confidence intervals are shown at the bottom. Source: From Rudnick (1997). Talley SIO 210 (2016) 10/10/16

Spectra: 3.D. Time series analysis: spectrum (DPO Section 6.5.3) Limitations are imposed by 1. sampling interval. The highest frequency that can be resolved in the spectrum (called the Nyquist frequency ) is 1/2Δt where Δt is the sampling interval. Any energy at higher frequencies becomes aliased into lower frequencies, causing error in their amplitude estimates. 2. length of the total record. Generally should have about 10 realizations of a given frequency for the amplitude to be significant. Can prove difficult when trying to detect climate change for instance. This is related to degrees of freedom.

3.D. Time series analysis: Fundamental frequency and Nyquist frequency: sources of error in time series analysis (DPO Section 6.5.3) Sampling: N samples at time interval Δt. Length of time series is then T = NΔt Fundamental frequency The lowest frequency that is resolved is» f = 1/T = 1/NΔt Nyquist frequency The highest frequency that is resolved is» f = 1/2Δt

3.D. Time series: Nyquist frequency and aliasing (DPO Section 6.5.3) Aliasing: what happens to high frequencies that are above the Nyquist frequency cutoff? They are in the record, and are sampled, but not resolvable because their frequency content is not resolved They are aliased to a lower frequency.

4. Mapping - Least squares methods (DPO Section 6.3.4) How do we map data that are not regularly spaced (particularly in space)? Least squares methods are often used for mapping etc. A. Objective mapping B. Data assimilation (merging data and computer models) (no info given here) C. Inverse methods (finding optimal solution given sparse or incomplete data) (no info given here)

4.A. Mapping - Spatial sampling: Objective analysis (DPO Section 6.4) Many oceanographic data are collected in several spatial dimensions. Sample separations are very irregular. We want to interpolate these in an optimal way to a grid for plotting and comparison with other data sets. Objective analysis is a method for interpolating. It requires prior knowledge of the spatial correlation scales of the data to be mapped; since these would normally be calculated from the data themselves, they are often chosen in an ad hoc fashion.

4.A. Oceanographic sampling: objective mapping in the vertical (DPO Section 6.4.1) Discrete bottle samples: objectively mapped to uniform grid in vertical (10 m) and horizontal (10 km) WOCE Pacific atlas http://woceatlas.ucsd.edu/index.html (Talley, 2007)

4.A. Oceanographic sampling: Vertical sections (DPO Section 6.4.1) Vertical coordinates: 1. Depth (pressure) 2. Potential density or neutral density (or theta) Talley SIO 210 (2016) DPO Figs. 4.11 and 9.17 10/10/16

4.A. Oceanographic sampling: Mapping in the horizontal (DPO Section 6.4.2) Original station data: objectively mapped to a regular grid (NOAA/ NODC) Different types of surfaces that are commonly used: 1. Constant depth 2. Isopycnal 3. core layer here salinity maximum DPO Fig. 6.4 Talley SIO 210 (2016) 10/10/16

5. Filtering (DPO Section 6.5.4) Time series or spatial sampling includes phenomena at frequencies that may not be of interest (examples: want inertial mode but sampling includes seasonal, surface waves, etc.) Use various filtering methods (running means with windows or more complicated methods of reconstructing time series) to isolate frequency of interest Example: Time series of a climate index with 1-year and 5-year running means as filters. DPO Fig. 6.9

6. Space-time sampling: empirical orthogonal functions etc (DPO Section 6.6.1) Underlying physical processes might not be best characterized by sines and cosines, especially in the spatial domain. They may be better characterized by functions that look like the basin or circulation geometries. Empirical orthogonal function analysis: let the data decide what the basic (orthogonal) functions are that add to give the observed values. Basic EOF analysis: obtain spatial EOFs with a time series of amplitude for each EOF. (The time series itself could be Fourier-analyzed if desired.)

6. EOF example: Southern Annular Mode Modes of decadal (climate) variability NAM

7. Water mass analysis (DPO Section 6.7.2) Property-property relations (e.g., theta-s) Volumetric property-property Optimum multiparameter analysis (OMP) (Section 2 only)

7. Water mass analysis: potential temperaturesalinity diagrams (DPO Section 6.7.2) Potential temperaturesalinity diagram used in several lectures looking at Atlantic properties Talley SIO 210 (2016) DPO FIGURE 10/10/166.14

7. Water mass analysis: potential temperaturesalinity diagrams (DPO Section 6.7.2) Potential temperature (ºC) versus salinity for (a) full water column, and (b) water colder than 10ºC. (c) Potential temperature versus oxygen for full water column. (d) Station location map. Colors indicate latitude range. Contours are potential density referenced to 0 dbar. Data are from the World Ocean Circulation Experiment (1988 1997). This figure can also be found in the color insert. TALLEY Copyright 2011 Elsevier Inc. All rights reserved DPO FIGURE 9.18

7. Water mass analysis: volumetric potential temperature-salinity (DPO Section 6.7.2) DPO 4.17b

7. Optimum multiparameter analysis (OMP) (DPO Section 6.7.3) Use of water properties (T, S, oxygen, nutrients, other tracers): define source waters in terms of properties, then use a least squares analysis to assign every observation to a proportion of each source water Example of NADW vs. AABW: fractions of each (Johnson et al., 2008) DPO 14.15