arxiv: v3 [physics.ao-ph] 27 Jan 2011

Similar documents
What have we learned about our climate by using networks and nonlinear analysis tools?

Interannual Variability of the South Atlantic High and rainfall in Southeastern South America during summer months

Global Climate network evolves with North Atlantic Oscillation phases: Coupling to Southern Pacific Ocean

Hilbert analysis unveils inter-decadal changes in large-scale patterns of surface air temperature variability

The Formation of Precipitation Anomaly Patterns during the Developing and Decaying Phases of ENSO

Global Atmospheric Dynamics Investigated by Using Hilbert Frequency Analysis

Semiblind Source Separation of Climate Data Detects El Niño as the Component with the Highest Interannual Variability

Interacting Networks in Climate. Marcelo Barreiro Department of Atmospheric Sciences School of Sciences, Universidad de la República, Uruguay

Forced and internal variability of tropical cyclone track density in the western North Pacific

Nonlinear atmospheric teleconnections

The construction of complex networks from linear and nonlinear measures Climate Networks

1. Introduction. 3. Climatology of Genesis Potential Index. Figure 1: Genesis potential index climatology annual

June 1993 T. Nitta and J. Yoshimura 367. Trends and Interannual and Interdecadal Variations of. Global Land Surface Air Temperature

NOTES AND CORRESPONDENCE. El Niño Southern Oscillation and North Atlantic Oscillation Control of Climate in Puerto Rico

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 23 April 2012

The North Atlantic Oscillation: Climatic Significance and Environmental Impact

El Niño / Southern Oscillation

Delayed Response of the Extratropical Northern Atmosphere to ENSO: A Revisit *

Dynamics of the Extratropical Response to Tropical Heating

JP1.7 A NEAR-ANNUAL COUPLED OCEAN-ATMOSPHERE MODE IN THE EQUATORIAL PACIFIC OCEAN

ENSO: Recent Evolution, Current Status and Predictions. Update prepared by: Climate Prediction Center / NCEP 9 November 2015

Quasi-Biennial Oscillation Modes Appearing in the Tropical Sea Water Temperature and 700mb Zonal Wind* By Ryuichi Kawamura

East-west SST contrast over the tropical oceans and the post El Niño western North Pacific summer monsoon

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 5 August 2013

The Influence of Intraseasonal Variations on Medium- to Extended-Range Weather Forecasts over South America

The Coupled Model Predictability of the Western North Pacific Summer Monsoon with Different Leading Times

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 11 November 2013

ENSO: Recent Evolution, Current Status and Predictions. Update prepared by: Climate Prediction Center / NCEP 30 October 2017

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 15 July 2013

Climate Outlook for December 2015 May 2016

The Role of Indian Ocean Sea Surface Temperature in Forcing East African Rainfall Anomalies during December January 1997/98

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 25 February 2013

Inter ENSO variability and its influence over the South American monsoon system

Discovering Dynamic Dipoles in Climate Data

lecture 11 El Niño/Southern Oscillation (ENSO) Part II

SHORT COMMUNICATION EXPLORING THE RELATIONSHIP BETWEEN THE NORTH ATLANTIC OSCILLATION AND RAINFALL PATTERNS IN BARBADOS

Interannual Teleconnection between Ural-Siberian Blocking and the East Asian Winter Monsoon

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 24 September 2012

An observational study of the impact of the North Pacific SST on the atmosphere

P2.11 DOES THE ANTARCTIC OSCILLATION MODULATE TROPICAL CYCLONE ACTIVITY IN THE NORTHWESTERN PACIFIC

Benguela Niño/Niña events and their connection with southern Africa rainfall have been documented before. They involve a weakening of the trade winds

2013 ATLANTIC HURRICANE SEASON OUTLOOK. June RMS Cat Response

June 1989 T. Nitta and S. Yamada 375. Recent Warming of Tropical Sea Surface Temperature and Its. Relationship to the Northern Hemisphere Circulation

Lecture 8: Natural Climate Variability

Evaluating a Genesis Potential Index with Community Climate System Model Version 3 (CCSM3) By: Kieran Bhatia

Global Climate Patterns and Their Impacts on North American Weather

THE INFLUENCE OF CLIMATE TELECONNECTIONS ON WINTER TEMPERATURES IN WESTERN NEW YORK INTRODUCTION

P3.6 THE INFLUENCE OF PNA AND NAO PATTERNS ON TEMPERATURE ANOMALIES IN THE MIDWEST DURING FOUR RECENT El NINO EVENTS: A STATISTICAL STUDY

Forcing of Tropical SST Anomalies by Wintertime AO-like Variability

Influence of autocorrelation on the topology of the climate network Oded C. Guez 1*, Avi Gozolchiani 2 and Shlomo Havlin 1. 1

Possible Roles of Atlantic Circulations on the Weakening Indian Monsoon Rainfall ENSO Relationship

Chapter outline. Reference 12/13/2016

Exploring Climate Patterns Embedded in Global Climate Change Datasets

El Niño Seasonal Weather Impacts from the OLR Event Perspective

the 2 past three decades

Change in Occurrence Frequency of Stratospheric Sudden Warmings. with ENSO-like SST Forcing as Simulated WACCM

Katherine E. Lukens and E. Hugo Berbery. Acknowledgements: Kevin I. Hodges 1 and Matthew Hawcroft 2. University of Reading, Reading, Berkshire, UK

Assessment of the Impact of El Niño-Southern Oscillation (ENSO) Events on Rainfall Amount in South-Western Nigeria

Ocean-Atmosphere Interactions and El Niño Lisa Goddard

ENSO, AO, and climate in Japan. 15 November 2016 Yoshinori Oikawa, Tokyo Climate Center, Japan Meteorological Agency

ENSO effects on mean temperature in Turkey

Dynamics of delayed-coupled chaotic logistic maps: Influence of network topology, connectivity and delay times

The Antarctic Dipole and its Predictability

Separation of a Signal of Interest from a Seasonal Effect in Geophysical Data: I. El Niño/La Niña Phenomenon

TROPICAL-EXTRATROPICAL INTERACTIONS

Reprint 675. Variations of Tropical Cyclone Activity in the South China Sea. Y.K. Leung, M.C. Wu & W.L. Chang

Contents of this file

Impacts of modes of climate variability, monsoons, ENSO, annular modes

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP July 26, 2004

CHINESE JOURNAL OF GEOPHYSICS. Analysis of the characteristic time scale during ENSO. LIU Lin 1,2, YU Wei2Dong 2

CHAPTER 1: INTRODUCTION

Introduction of climate monitoring and analysis products for one-month forecast

SEASONAL ENVIRONMENTAL CONDITIONS RELATED TO HURRICANE ACTIVITY IN THE NORTHEAST PACIFIC BASIN

Climate Variability. Andy Hoell - Earth and Environmental Systems II 13 April 2011

Impact of Zonal Movement of Indian Ocean High Pressure on Winter Precipitation over South East Australia

Inter-comparison of Historical Sea Surface Temperature Datasets

NOTES AND CORRESPONDENCE. On the Seasonality of the Hadley Cell

PRMS WHITE PAPER 2014 NORTH ATLANTIC HURRICANE SEASON OUTLOOK. June RMS Event Response

Analysis of Fall Transition Season (Sept-Early Dec) Why has the weather been so violent?

The nonlinear association between ENSO and the Euro-Atlantic winter sea level pressure

Analysis Links Pacific Decadal Variability to Drought and Streamflow in United States

Effect of anomalous warming in the central Pacific on the Australian monsoon

Large-Scale Circulation Features Typical of Wintertime Extensive and Persistent Low Temperature Events in China

Supplement of Vegetation greenness and land carbon-flux anomalies associated with climate variations: a focus on the year 2015

Figure 1. Time series of Western Sahel precipitation index and Accumulated Cyclone Energy (ACE).

3. Carbon Dioxide (CO 2 )

Teleconnections and Climate predictability

Monitoring and Prediction of Climate Extremes

P6.16 A 16-YEAR CLIMATOLOGY OF GLOBAL RAINFALL FROM SSM/I HIGHLIGHTING MORNING VERSUS EVENING DIFFERENCES 2. RESULTS

Introduction of products for Climate System Monitoring

Conference on Teleconnections in the Atmosphere and Oceans November Fall-to-winter changes in the El Nino teleconnection

ENSO and April SAT in MSA. This link is critical for our regression analysis where ENSO and

Definition of Antarctic Oscillation Index

Oceanic origin of the interannual and interdecadal variability of the summertime western Pacific subtropical high

CLIMATE SIMULATION AND ASSESSMENT OF PREDICTABILITY OF RAINFALL IN THE SOUTHEASTERN SOUTH AMERICA REGION USING THE CPTEC/COLA ATMOSPHERIC MODEL

Tropical drivers of the Antarctic atmosphere

The feature of atmospheric circulation in the extremely warm winter 2006/2007

Climate Dynamics (PCC 587): Hydrologic Cycle and Global Warming

Analysis of meteorological measurements made over three rainy seasons in Sinazongwe District, Zambia.

J3.3 DECADAL VARIABILITY OF THE ENSO TELECONNECTION TO THE SOUTH PACIFIC GOVERNED BY COUPLING WITH THE ANTARCTIC OSCILLATION

Transcription:

Inferring long memory processes in the climate network via ordinal pattern analysis Marcelo Barreiro and Arturo C. Marti Instituto de Física, Facultad de Ciencias, Universidad de la República, Iguá 4225, Montevideo, Uruguay Cristina Masoller Departament de Fisica i Enginyeria Nuclear, Universitat Politecnica de Catalunya, Colom 11, E-8222 Terrassa, Barcelona, Spain (Dated: December 19, 217) arxiv:11.1564v3 [physics.ao-ph] 27 Jan 211 We use ordinal patterns and symbolic analysis to construct global climate networks and uncover long and short term memory processes. The data analyzed is the monthly averaged surface air temperature (SAT field) and the results suggest that the time variability of the SAT field is determined by patterns of oscillatory behavior that repeat from time to time, with a periodicity related to intraseasonal oscillations and to El Niño on seasonal-to-interannual time scales. PACS numbers: 5.4.-a, 5.4.Ca, 5.45.Tp, 2.5.-r Keywords: Climate analysis, complex networks, ordinal patterns, symbolic time series We analyze climatological data from a complex networks perspective, using techniques of nonlinear time-series symbolic analysis. Specifically, we employ ordinal patterns and binary representations to analyze monthly-averaged surface air temperature (SAT) anomalies. By computing the mutual information of the time-series in regular grid points covering the Earth s surface and then performing global thresholding, we construct climate networks which uncover short-term memory processes, as well as long ones (5-6 years). Our results suggest that the time variability of the SAT anomalies is determined by patterns of oscillatory behavior that repeat from time to time, with a periodicity related to intraseasonal variations and to El Niño on seasonal to interannual time scales. The present work is located at the triple intersection of three highly active interdisciplinary research fields in nonlinear science: symbolic methods for nonlinear time series analysis, network theory, and nonlinear processes in the earth climate. While a lot of effort is being done in order to improve our understanding of natural complex systems, with many different methods for mapping time series to network representations being investigated and employed in complex systems such as the human brain, our work is the first one aimed at characterizing the global climate network in terms of oscillatory patterns that tend to repeat from time to time, with various time scales. By mapping these processes into a global network, using ordinal patterns and binary representations, we find that the structure of the network changes drastically at different time scales. I. INTRODUCTION Complex networks have been intensively studied in the last years because they represent many real systems such as the Internet, ecological, social and metabolic networks, genes, cells and the brain [1]. Global climate modeling is also a hot topic nowadays because of its huge economic and social impact for future generations. Giving the complexity of the inter-relations between the different elements that constitute our environment, it is important to analyze climatological data from a complex network perspective. However, despite the intensive effort in research done in these two interdisciplinary and fascinating fields, just very few studies have combined both [2 8]. These studies have shown that network theory can yield light into interesting, previously unknown features of our climate. Tsonis and Swanson [4] and Yamasaki, Gozolchiani and Halvin [3] have shown that the climate network is significantly affected by El Niño, as during El Niño years many links of the network are broken. Tsonis and Swanson [4] constructed cross-correlation-based networks of the SAT field for El Niño and for La Niña years and investigated their structure. They found that the El Niño network possesses significantly fewer links and lower clustering coefficient and characteristic path length than the La Niña network. They conjectured and verified that, because El Niño network is less communicative and less stable than La Niña one, during El Niño years temperature predictability is lower compared to La Niña years. Using a different approach, Yamasaki, Gozolchiani and Halvin [3] arrived at a similar conclusion. They developed a method which allows to follow time variations of the network structure by observations of fluctuations in the correlations between nodes. The method allows to distinguish between the two qualitatively different groups of network links, blinking links that appear and disappear in a short time, and robust links that represent long lasting relations between temperature fluctuations in two regions. Assuming that broken links are due to structural

2 changes in the network, by tracking these changes in several zones a strong response to El Niño was reveled, even in geographical regions where the mean temperature is not affected by El Niño. Donges et al. [6] compared the structural properties of networks constructed by using, as a measure of dynamical similarity between regions, linear and nonlinear measures: the linear Pearson correlation coefficient and the nonlinear mutual information. They analyzed two sets of data: the SAT anomalies obtained from large-scale climate simulations by the coupled atmosphere-ocean general circulation models and the SAT anomalies reanalysis data sets. A high degree of similarity using the two approaches (linear and nonlinear similarity measures) was found on the local and on the mesoscopic topological scales; however, important differences were uncovered on the global scale, particularly in the betweenness centrality field. In [7] Donges et al. employed the mutual information to reveal wave-like structures of high-energy flow, that could be traced back to global surface ocean currents. Their results point to the major role of the oceanic surface circulation in coupling and stabilizing the global temperature field in the long-term mean. When computing the mutual information, in order to detect patterns and correlations in the variability of two nodes, a critical issue is defining probability distribution functions (PDFs) that fully take into account the temporal order in which the SAT anomalies occur in the time-series. Histogram-based PDFs do not take into account this temporal order, and thus, are not optimal for capturing subtle correlated oscillatory patterns. Alternatively, one can use time-delay embedding techniques to represent the time series as a trajectory in a highdimensional space; however, the information provided by the mutual information is strongly dependent on the embedding technique, the time-delay, and the phase space partition [9]. An alternative methodology, originally proposed by Bandt and Pompe (BP) [1] allows to define probability distribution functions that fully take into account the time ordering of the SAT anomalies. The BP method is based on comparing values in the time-series to construct ordinal patterns. By computing the PDF of the possible ordinal patterns, various information-theory quantifiers, such as the permutation entropy, the mutual information, complexity measures, etc. can be computed. The BP method has been successfully employed to analyze time-series generated from physical, biological and social systems (see, e.g., [11] and references therein). When employing the BP methodology the precise values of the SAT anomalies are neglected (as the method is based on comparing relative values in the time-series); however, as we will show, with the BP method one can identify patterns of oscillatory behavior that tend to repeat from time to time, with various time scales. A drawback of the BP method is that, in order to capture long memory processes, long time series are needed to compute the PDFs of the ordinal patterns with good statistics. The SAT data available (described in the next section) limited us to construct ordinal patterns of maximum length 5, which allows to consider time-scales up to 5 years or 5 months. To overcome this limitation we employed binary representations, by which the timeseries of SAT anomalies were transformed into sequences of s and 1s. These binary representations allowed to consider processes with longer time-scales, up to 6 years or 6 months. We will show in what follows that ordinal patterns and binary representations are tools that, when employed within a complex network perspective, are very powerful for the analysis of climatological data. By reveling long term and short term memory processes they provide additional information to that obtained from conventional time-series analysis and thus they help to a better understanding of our complex climate. This article is organized as follows: Section II presents the description of the data analyzed and a summary of the methodology employed. Section III presents the results obtained with ordinal patterns and binary representations, and a comparison with the methodologies previously employed by other authors (i.e., the linear crosscorrelation[4] and the nonlinear histogram-based mutual information [6]). Section IV contains a discussion of the results and the conclusions. II. DATA AND METHODS We present the analysis of the monthly averaged surface air temperature (SAT field, reanalysis data from the National Center for Environmental Prediction/National Center for Atmospheric Research, NCEP/NCAR [12]). As in [2, 4, 6, 7], anomaly values are considered (i.e., the actual temperature value minus the monthly average). The data covers a regular grid over the earth s surface with latitudinal and longitudinal resolution of 2.5. These N = 1226 grid points are considered the nodes (or vertices) of a network (or graph), and the existence of a link (or edge) between any two nodes depends on the weight of the link that measures the degree of statistical similarity between the climate dynamics in those two nodes. The data covers the period January 1949-December 26, and therefore in each grid point i (i = 1...N) we have M = 696 data points, {x i (t),t = 1...M}. W = {w ij,i,j = 1...N} is the matrix that contains the weights that characterize the links between any two nodes. Since we don t attempt to uncover directionality in the couplings among the nodes, we will consider a symmetric measure of statistical similarity that results in symmetric weights. In[2, 4] these weights were quantified with the absolute value of the linear cross-correlation coefficient; in [6, 7], with the mutual information, a nonlinear measure that is a function of the probability density functions (PDFs) that characterize the time series in the two nodes, p i (m)

3 and p j (n), as well as of the joint probability, p ij (m,n), W ij = M ij = m,n p ij (m,n)log p ij(m,n) p i (m)p j (n). (1) The mutual information, which can also be written as M ij = S i +S j S ij, (2) where S i = p i logp i, S j = p j logp j and S ij = p ij logp ij, indicates the amount of information of {x i (t)}, we obtain by knowing {x j (t)}, and vice versa. M ij measures the degree of statistical interdependence of the time series; if they are independent, p ij (m,n) = p i (m)p j (n) and M ij =. To uncover correlated patterns of oscillatory behavior in the SAT anomalies, we employ the methodologies referred to as ordinal patterns and symbolic analysis, which are based on comparing consecutive values in the time series, to compute the PDFs in Eq.(1). We begin by presenting the ordinal pattern methodology [1]. First, in each grid point i, the time series {x i (t)} is divided into M D overlapping vectors of dimension D. Then, each element of a vector is replaced by a number from to D 1, in accordance with its relative magnitude in the ordered sequence ( corresponding to the smallest and D 1 to the largest value in each vector). For example, with D = 3 the vector (v,v 1,v 2 ) = (6.8, 11.5, 1.1), gives the ordinal pattern 21 because v 2 < v < v 1. In this way, each vector has associated an ordinal pattern (OP) composed by D symbols, and the symbol sequence comes from a comparison of neighboring values. Last, one computes the PDF of the D! possible ordinal patterns. For example, with D = 3 the 3! = 6 different patterns are (12, 21, 12, 12, 21 and 21), and thus, the PDF is calculated with 6 bins. To have a good statistics one must have M D >> D! (i.e., # of OPs in the time series >> # of possible OPs). Because in each time series we have M = 696 data points, to compute the PDFs with good statistics we limit to consider only D = 4 and D = 5. Ordinal patterns of D 3 do not provide good resolution for computing the mutual information, Eq. 1, because the PDFs are calculated with very few bins (for D = 6, there are only 6 ordinal patterns, and thus, only 6 bins). With climatological data meaningful ordinal patterns can be formed either by comparing consecutive years or consecutive months. Specifically, if we use D = 3, when comparing consecutive years, the OPs in node i are defined by (x i (t),x i (t+12),x i (t+24)), t = 1,...,M 24; when comparing consecutive months, they are defined by (x i (t),x i (t+1),x i (t+2)), t = 1,...,M 2. To decide whether there is a link between two nodes, we perform global thresholding [13], i.e., we define a threshold τ (which is the same for all pairs of nodes) and assume that there is a link between i and j if the weight of the link is above the threshold, i.e., w ij τ. Clearly, a careful selection of the threshold is crucial for uncovering the backbone of the network [6]. We use the followingprocedure: first, we checkthat we only take into account significant network connections. To do this we compute the weight matrix W from randomly shuffled time series in each node. The random elements of this 1226 1226 matrix have a very narrow PDF which, in principle, allows the use of the maximum matrix element, w max, as a significant limit. Then, we compute W with the original time series and consider that there is a significant link between the nodes i and j if w ij > w max, otherwise, we set w ij =. While there are several methods to eliminate non-significant links and the evaluation of statistical significance is still an open problem (see, e.g., the discussion in [6]), this procedure is computationally cost-efficient and we will show in what follows that allows to uncover meaningful climate networks. The drawback is that it is a rather strong test that eliminates weak but significant links, and as a results the networks tend to be very spare. The final step is to chose a threshold τ to select the strongest links. As in [6], we chose τ such that the resulting networks have a pre-determined number of links. In the following we present results for networks that have 1% of the total possible links (which will be referred to as lowthreshold networks) and networks containing.1% of the total possible links (referred to as high-threshold networks). For easy comparison and to visualize the effect of thresholding, we also present the networks containing all the significant links, which will be referred to as the zero-threshold networks. The networks are represented graphically as twodimensional maps by plotting the area-weighted connectivity [2, 4, 6, 7], which is the fraction of the total area of the earth to which each node i is connected, N j AWC i = A ij cos(λ i ) N j cos(λ, (3) j) where λ i is the latitude of node i and A ij = 1 if nodes i and j are connected (i.e., if w ij τ) and otherwise. The cosine terms correct for the fact that in a surface spherical network defined on a regular planar grid, the nodes correspond to regions of different area. III. RESULTS A. Ordinal Patterns Analysis The networks obtained when the ordinal patterns are defined by comparing SAT anomalies in consecutive years and in consecutive months are displayed in Figs. 1-4. In eachpanel the values ofthe threshold, τ, and ofthe edgedensity, are indicated. ρ = N i,j A ij N(N 1), (4)

4 τ = ρ =.27.218 τ =.227 ρ =.1.8 τ =.54 ρ =.1.115 6N 6N.1744 6N.64.92 3N 3N.138 3N.48.69 3S.872 3S.32 3S.46 6S.436 6S.16 6S.23 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 1. Zero-threshold (left), low-threshold (center) and high-threshold (right) networks constructed by computing the mutual information from ordinal patterns of length D = 4 defined by comparing SAT anomalies in consecutive years. The 2D plots are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. In each panel the values of the threshold, τ, and of the edge-density, ρ, are indicated. τ = ρ =.6.565 τ =.674 ρ =.1.9 6N 3N 3S 6S 6N.452 3N.339.226 3S.113 6S.72.54.36.18 9E 18E 9W 9E 18E 9W FIG. 2. Zero-threshold (left) and high-threshold (right) networks constructed by computing the mutual information from ordinal patterns of length D = 5 defined by comparing SAT anomalies in consecutive years. The 2D plots are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. The weaker links lose a bit of memory (compare the zero-threshold networks with D = 4 and D = 5) while the strong links do not, as the high-threshold networks are the nearly same for D = 4 and D = 5. For consecutive years the networks with D = 4, Fig. 1, and D = 5, Fig. 2 are very similar showing highest connectivity for the tropical region. For zero-threshold (left panels in Figs. 1 and 2) the tropical Pacific shows the largest connectivity, particularly in the central and eastern side of the basin; the tropical Atlantic and Indian oceans follow. In the extratropics there are patches of high connectivity off the western coast of Canada in the Northern Hemisphere (N.H.) and in the south Pacific in the Southern Hemisphere (S.H.). This connectivity structure is more pronounced for D = 5, although some of the weak links lose memory. These characteristics hint to El Niño as a fundamental player in setting up these connections [14]. The El Niño phenomenon occurs on interannual time scales and consists in an anomalous warming of the eastern equatorial Pacific. This warming in turn warms up the local atmosphere and influences other tropical regions through the excitation of equatorial Kelvin and Rossby waves. Changes in the precipitation associated with El Niño also induce stationary Rossby waves in the northern and southern extratropics that generate long range connections called atmospheric teleconnection patterns. Examples of these structures are the Pacific-North American pattern that affects the northern Pacific and North America, and the Pacific-South American pattern that propagates in the southern Pacific toward South America. These anomalous structures connect the tropical Pacific with remote locations and affect the local climate by changing, for example, the advection of heat or moisture into a region. For non-zero-thresholds (center and right panels in Figs. 1 and 2) only the strongest links remain and the networks clearly show again an El Niño-like structure in the tropical Pacific. Secondary maxima in the Indian and Atlantic oceans are present in the low-threshold networks but are significantly weakened in the high-threshold ones. The continental regions have overall very low connec-

5 τ = ρ =.18.41 τ =.186 ρ =.1.215 τ =.476 ρ =.1.15 6N 6N.328 6N.172.12 3N 3N.246 3N.129.9 3S.164 3S.86 3S.6 6S.82 6S.43 6S.3 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 3. As Fig. 1 but D = 4 ordinal patterns defined by comparing SAT anomalies in consecutive months. The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. τ = ρ =.4.7 τ =.656 ρ =.1.15 6N 3N 3S 6S 6N.56 3N.42.28 3S.14 6S.12.9.6.3 9E 18E 9W 9E 18E 9W FIG. 4. Zero-threshold (left) and high-threshold (right) networks constructed by computing the mutual information from ordinal patterns of length D = 5 defined by comparing SAT anomalies in consecutive months. The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. tivity which translates in the low predictability of surface temperature anomalies on interannual time scales. Within these continental regions, the largest connectivity is seen over Asia and North America, the latter maximum being perhaps due to the Pacific North American pattern induced by El Niño [15]. The networks obtained when the ordinal patterns are defined by comparing temperature anomalies in consecutivemonths, Figs. 3, 4, present, forzero-andlow-threshold, similar features as for consecutive years, although the networks are more homogeneous. There is a maximum in the equatorial Pacific, a secondary maximum in the Indian ocean and extratropical maxima over Asia, North America and southern subtropics. On the other hand, the high threshold network shows that the strongest links are located in the extra tropics. We speculate that this could be a result of the modulation of the temperature variance by the seasonal cycle, which is strongest over the northern hemisphere continental masses and has a minimum in the tropical band. This network structure is also seen when using ordinal patterns formed by 5 consecutive months, Fig. 4. B. Binary representations To capture longer memory processes one should use larger D values; however, for D = 6 there are 6! = 72 possible ordinal patterns, and since we have time series with less than 7 data points, there is not enough data to calculate ordinal patterns PDFs with good statistics. As discussed in the introduction, a solution to overcome this problem is employing binary representations, by which the time series {x i (t)} is transformed into a sequence {v i (t)} of s and 1s, using the following rule: v i (t) = if x i (t) and v i (t) = 1 otherwise (since the x i values are temperature anomalies, we are taking into account whether the SAT field is above or below its

6 τ = ρ =.41.375 τ =.218 ρ =.1.85 τ =.512 ρ =.1.9 6N 6N.246 6N.644.72 3N 3N.1845 3N.483.54 3S.123 3S.322 3S.36 6S.615 6S.161 6S.18 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 5. As Fig. 1 but employing binary representation. Zero-threshold (left), low-threshold (center) and high-threshold (right) networks constructed by computing the mutual information from patterns of length D = 4 defined by comparing SAT anomalies in consecutive years. The 2D plots are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. In each panel the values of the threshold, τ, and of the edge-density, ρ, are indicated. τ = ρ =.21.25 τ =.331 ρ =.1.885 τ =.566 ρ =.1.1 6N 6N.164 6N.78.8 3N 3N.123 3N.531.6 3S.82 3S.354 3S.4 6S.41 6S.177 6S.2 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 6. As Fig. 5 but with D = 5. monthly averaged value). We can then define binary patterns of dimension D (e.g., for D = 3 the possible patters are, 1, 1, 1, 11, 11, 11 and 111) and compute their PDF. The number of different patterns is 2 D, and thus, we can calculate PDFs of patterns of D = 6 (2 6 = 64) with good enough statistics. Patterns with D 3 do not providegoodresolutionfor computing the mutual information, Eq. 1, because the PDFs are calculated with very few bins. Therefore, in the following, we consider D = 4, 5, and 6. Figures 5-1 present the results when the binary patterns are defined by consecutive years and months. For consecutive years, Figs. 5-7, the networks obtained when using binary representations are very similar to those found with ordinal patterns. The tropical regions are quite uniformly well connected (although a Pacific maximum is clear) while the extratropics show localized regions of high connectivity likely due to atmospheric teleconnections forced from the tropics, particularly for low density networks. The networks obtained for consecutive months, Figs. 8-1, show that as the threshold or as D increases there are overall similar changes in structure as those seen for ordinal patterns, Figs. 3, 4: for short memory processes (or for low threshold) the maximum connectivity is in the tropics, while for longer time scales (or for higher threshold) the extratropics show largest number of links. As discussed before, this could be the result of the modulation of the temperature variance by the seasonal cycle, which would be the main process that connects grid points in very low density networks (grid points that are connected by very strong links). Our results could also hint at the roleofland surfaceconditions like snoworsoil humidity in increasing the persistence of surface temperature anomalies over the northern hemisphere continents. Overall, these results agree well with the fact that temperature teleconnections from the tropical Pacific tend to last no much longer than a season in the different parts of the world. As a way to test the interpretation of the above presented results, in terms of the symbolic methodology of time-series analysis capturing two different time-scales of the Earth s climate, seasonal and interannual, we constructed binary patterns of fixed dimension, D = 6, that cover three different time intervals:

7 τ = ρ =.8.88 τ =.645 ρ =.1.95 6N 3N 3S 6S 6N.74 3N.528.352 3S.176 6S.76.57.38.19 9E 18E 9W 9E 18E 9W FIG. 7. As Fig. 5 but with D = 6. Comparing with Figs. 5 and 6 one can see that there is a good agreement with the results obtained previously with ordinal patterns: the weaker links tend to lose memory, while the strongest links do not. τ = ρ =.22.114 τ =.186 ρ =.1.475 τ =.481 ρ =.1.15 6N 6N.912 6N.38.12 3N 3N.684 3N.285.9 3S.456 3S.19 3S.6 6S.228 6S.95 6S.3 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 8. As Fig.5 but the networks constructed with binary representation comparing anomalies in D = 4 consecutive months. The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. i) covering one-year, the patterns are composed as [x i (t),x i (t+2),x i (t+4),...,x i (t+1)]; ii) covering two-years, the patterns are composed as [x i (t),x i (t+4),x i (t+8),...,x i (t+2)]; ii) covering three-years, the patterns are composed as [x i (t),x i (t+6),x i (t+12),...,x i (t+3)]. The results are presented in Fig.11, were one can see how the network changes. For a time interval of one year the extra tropics have the largest number of links and there are very few in the tropical region. On the other hand, for a time interval of three years, the extra-tropics keep about the same number of connections while in tropical Pacific El Niño stands out (note the different color scales in the panels in Fig.11). In summary, confirming the results previously found with ordinal patterns and binary representations composed by consecutive years and by consecutive months, on intra-seasonal time scales the extra tropical connections dominate, while on interannual scales, the El Niño is the key player in setting up teleconnections worldwide. As it was previously discussed, the significance of the network links was tested in comparison with links computed from surrogate time-series in each node. Since surrogate data does not preserve the autocorrelation properties of the original time-series, to further test the validity of the previously presented results, we did the following test: we computed the mutual information using the original time series in node i and the time-inverted series in node j. The results show no significant spatial structure in the area weighted connectivity plots (not shown). C. Comparison with other measures It is interesting to compare the results obtained using ordinal patterns and binary representations with those obtained using conventional techniques of timeseries analysis, as the linear cross-correlation coefficient (as in Ref. [4]) and the mutual information, computing

8 τ = ρ =.12.245 τ =.533 ρ =.1.15 6N 3N 3S 6S 6N.196 3N.147.98 3S.49 6S.12.9.6.3 9E 18E 9W 9E 18E 9W FIG. 9. As Fig. 8 but with D = 5. τ = ρ =.5.95 τ =.68 ρ =.1.15 6N 3N 3S 6S 6N.76 3N.57.38 3S.19 6S.12.9.6.3 9E 18E 9W 9E 18E 9W FIG. 1. As Fig. 8 but with D = 6. the PDFs from standard histograms of amplitude values (as in Refs. [6, 7]). Figure 12 displays the zero, low and high threshold networks when the weights are calculated with the absolute value of the cross-correlation coefficient coefficient and Fig. 13, when they are calculated with mutual information, with the PDFs calculated from histograms of temperature anomaly values. In this case the PDFs were computed employing 32 bins and in each time-series the values of the SAT anomalies were re-normalized such that each time-series has zero mean and standard deviation equal to one (as in Ref. [6, 7], for easier comparison). The 2D plots of the area weighted connectivity are similar to those previously reported in Ref. [6] and also, to those seen in Figs. 1, 2, 5, 6, where the ordinal patterns and the binary representations are formed by comparing consecutive years. El Niño is the main feature uncovered. There are also regions with relatively high number of links in the northern hemisphere continents and southern subtropics, but the high connectivity in the extratropics seen previously in Figs. 3, 4, 8, 9, and 1 is not observed. In other words, employing the cross correlation coefficient or the histogram-based mutual information uncovers mainly the interannual network. These methodologies fail to separate the two distinct time-scales (intra-seasonal and inter-annual) that are clearly seen when using symbolic analysis and the time series are transformed in sequences of patterns by comparing consecutive years or consecutive months. IV. CONCLUSIONS Concluding, we have shown that ordinal patterns and symbolic analysis applied to anomalies of the surface air temperature are powerful tools for the analysis of the large-scale topology of the climate network. The success of these methods is based on an appropriate partition of the phase space that results in ordinal patterns and binary representations having PDFs that characterize the diversity of patterns present in the climate dynamics. A main advantage of the methodology proposed here is that by varying the dimension of the pattern and the year-month comparison, one can uncover memory processes with different time scales. We found that both, monthly and yearly patterns reveal long memory pro-

9 τ =.62 ρ =.1.15 τ =.625 ρ =.1.15 τ =.635 ρ =.1.45 6N 6N.12 6N.12.36 3N 3N.9 3N.9.27 3S.6 3S.6 3S.18 6S.3 6S.3 6S.9 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 11. High-threshold networks constructed with binary representations, with patterns of D = 6 covering time-intervals of one year (left), two years (center) and three years (right). The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. See text for details. τ = ρ =.95.433 τ =.643 ρ =.1.66 τ =.932 ρ =.1.6 6N 6N.3464 6N.528.48 3N 3N.2598 3N.396.36 3S.1732 3S.264 3S.24 6S.866 6S.132 6S.12 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 12. Zero-threshold network(left) and non-zero-threshold networks (center and right) constructed by estimating the weights with the absolute value of the cross-correlation coefficient. The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. cesses, and that depending on the time scale considered the climate network can change completely. The fact that ordinal patterns and symbolic analysis give meaningful information indicates that the time variability of the anomaly SAT field is strongly determined by patterns of oscillatory behavior that tend to repeat from time to time. Overall we found that on seasonal time-scales the extratropical regions, mainly over Asia and North America, present the strongest links while in interannual time scales, the tropical Pacific clearly dominates. The authors thank the two anonymous referees for their very useful comments and suggestions. M.B. acknowledges support from the European Community s Seventh Framework Programme (FP7/27-213) under Grant Agreement N 212492 (CLARIS LPB. A Europe- South America Network for Climate Change Assessment and Impact Studies in La Plata Basin). C.M. acknowledges support from the ICREA Academia programme, the Ministerio de Ciencia e Innovación, Spain, project FIS29-1336 and the Agencia de Gestio d Ajuts Universitaris i de Recerca (AGAUR), Generalitat de Catalunya, through project 29 SGR 1168. ACKNOWLEDGMENTS [1] A. Arenas, A. Diaz-Guilera, J. Kurths, Y. Moreno and C. Zhou, Synchronization in complex networks, Phys. Rep. 469, 93-153 (28).

1 τ = ρ =.39.2145 τ =.167 ρ =.1.695 τ =.456 ρ =.1.7 6N 6N.1716 6N.556.56 3N 3N.1287 3N.417.42 3S.858 3S.278 3S.28 6S.429 6S.139 6S.14 9E 18E 9W 9E 18E 9W 9E 18E 9W FIG. 13. Zero-threshold network(left) and non-zero-threshold networks (center and right) constructed by estimating the weights with the mutual information, calculating the PDFs from histograms of SAT anomalies. The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. [2] A. A. Tsonis, K. L. Swanson and P. J. Roebber, What do networks have to do with climate? Bull. Amer. Meteorol. Soc. 87, 585 (26). [3] K. Yamasaki, A. Gozolchiani and S. Halvin, Climate Networks around the Globe are significantly affected by El Niño, Phys. Rev. Lett. 1, 22851 (28). [4] A. A. Tsonis and K. L. Swanson, Topology and Predictability of El Niño and La Niña Networks, Phys. Rev. Lett. 1, 22852 (28). [5] A. A. Tsonis, K. L. Swanson and G. Wang, On the role of atmospheric teleconnections in climate, J. Climate 21, 299 (28). [6] J. F. Donges, Y. Zou, N. Marwan et al., Complex networks in climate dynamics, Eur. Phys. J. Spec. Top. 174 157 (29). [7] J. F. Donges, Y. Zou, N. Marwan et al., The backbone of the climate network, Eur. Phys. Let. 87 487 (29). [8] S. Bialonski, M. T. Horstmann, and K. Lehnertz, From brain to earth and climate systems: Small-world interaction networks or not?, Chaos 2 13134 (21). [9] R. Q. Quiroga, A. Kraskov, T. Kreuz and P. Grassberger, Performance of different synchronization measures in real data: A case study on electroencephalographic signals, Phys. Rev. E 65, 4193 (22). [1] C. Bandt and B. Pompe, Permutation entropy: A natural complexity measure for time series, Phys. Rev. Lett. 88, 17412 (22). [11] J. M. Amigo, Permutation complexity in dynamical systems: ordinal patterns, permutation entropy and all that Springer, Berlin, Germany (21). [12] E. Kalnay et al. The NCEP/NCAR 4-Year Reanalysis Project, Bulletin of the American Meteorological Society 77 (3): 437471. [13] V. M. Eguiluz et. al, Scale-free brain functional networks, Phys. Rev. Lett. 94, 1812 (25). [14] K. E. Trenberth, G. W. Branstator, D. Karoly, A. Kumar, N-C. Lau, and C. Ropelewski, Progress during TOGA in understanding and modeling global teleconnections associated with tropical sea surface temperatures, J. Geophys. Res. 13 14291-14324 (1998). [15] C. F. Ropelewski and M. S. Halpert, North American precipitation and temperature patterns associated with the El Nio/Southern Oscillation (ENSO), Mon. Wea. Rev. 114 23522362 (1986).