Big Data and Macroeconomic Nowcasting

Size: px
Start display at page:

Download "Big Data and Macroeconomic Nowcasting"

Transcription

1 Big Data and Macroeconomic Nowcasting George Kapetanios, Massimiliano Marcellino & Fotis Papailias King s College London, UK Bocconi University, IT Queen s Management School, UK Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

2 Outline Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

3 Outline Introduction Types of Big Data Pros and Cons of Big Data for Macroeconomic Nowcasting Data & Methodological Issues Part 1: Literature Review Existing literature on the use of Big Data Existing literature on Macroeconomic Nowcasting What about their combination? Part 2: Big Data Modelling Machine Learning Heuristic Optimisation Dimensionality Reduction Forecast Combination & Model Averaging Mixed Frequency Methods for Big Data Big Data Access, Cleaning and Preparation Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

4 Outline Part 3: Empirical Analysis Data Description Nowcasting & Short-Term Forecasting Evaluation Conclusions Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

5 Introduction Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

6 Introduction The recent crisis has emphasized the importance for policy-makers and economic agents of a real-time assessment of the current state of the economy and its expected developments, when a large but incomplete information set is available. The main obstacle is the delay with which key macroeconomic indicators such as GDP and its components, but also fiscal variables, regional/sectoral indicators and disaggregate data, are released. Example: GDP data are only available on a quarterly basis and the advance/flash estimate is only published with a 4-6 week delay or longer, depending on the country. Moreover, preliminary data are often revised afterwards, in particular around turning points of the business cycle. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

7 Introduction On the other hand, a large and growing number of more timely leading and coincident indicators is available, at a monthly, daily or even higher frequency, based in particular on financial, survey and internet data, though sometimes subject to short samples, missing observations and other data irregularities. This has stimulated a vast amount of statistical and econometric research on how to exploit the large, timely and higher frequency but irregular information to provide estimates for key low frequency economic indicators. A parallel, more empirical, literature has instead focused specifically on the use of big data for nowcasting economic indicators, often using rather simple econometric techniques and specific big data sources, mainly Google Trends. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

8 Introduction Finally, a more theoretical literature has developed new, or adapted old, statistical and econometric methods to handle very large sets of explanatory variables, such as those associated with big data. Broadly speaking, they are based either on summarizing the many variables, or on selecting them, or on combining many small models, after a proper data pre-treatment. Some techniques are borrowed from machine learning, where prediction is of key interest but data are typically assumed to be i.i.d. rather than serially correlated and possibly with changing variances over time, so that these techniques have to be properly extended prior to use on economic data. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

9 Introduction: Big Data Types The literature provides various definitions of big data. Part of the problem is that the meaning of big data can differ significantly across disciplines. One possibility to obtain a general classification is to adopt the 4 Vs classification, originated by the IBM, which relates to: (i) Volume (Scale of data), (ii) Velocity (Analysis of streaming data), (iii) Variety (Different forms of data) and (iv) Veracity (Uncertainty of data). However, this classification seems too general to guide empirical nowcasting applications. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

10 Introduction: Big Data Types A second option is to focus on numerical data only, which can either be the original big data or the result of a transformation of unstructured data, and refer to the size of the dataset. Unstructured data such as, e.g., credit card transactions or other data that describe the disaggregated actions or characteristics of many agents, are likely to need to be transformed to a two dimensional panel structure where time is usually one dimension. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

11 Introduction: Big Data Types Following, e.g. Doornik and Hendry (2015), we can distinguish three main types of big data: 1 Tall 2 Fat 3 Huge Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

12 Introduction: Big Data Types Tall Tall : not so many variables, N, but many observations, T, with T N. This is for example the case with tick by tick data on selected financial transactions or search queries. In this case T is indeed very large in the original time scale, say seconds, but it should be considered whether it is also large enough in the time scale of the target macroeconomic variable of the nowcasting exercise, say quarters. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

13 Introduction: Big Data Types Tall Tall datasets at very high frequency are not easily obtained, as they are generally owned by private companies. Moreover, they generally require substantial pre-treatment, as indicators typically present particular temporal structures (related, e.g., to market micro-structure) and other types of irregularities, such as outliers, jumps and missing observations. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

14 Introduction: Big Data Types Fat Fat : many variables, but not so many observations, N T. Large cross-sectional databases fall into this category, which is not so interesting from an economic nowcasting point of view, unless either T is also large enough or the variables are homogeneous enough to allow proper model estimation (e.g., by means of panel methods) and nowcast evaluation. However, Fat datasets can be of interest in many other applications of big data, both inside official statistics, e.g., for surveys construction, and outside, e.g., in marketing or medical studies. Also, as the collection of big data started only rather recently, Fat datasets are perhaps the most commonly available type. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

15 Introduction: Big Data Types Fat Actually, statistical methods for big data are mainly meant for Fat datasets, e.g., those developed in the machine learning literature, as they only require a large cross-section of i.i.d. variables. When a (limited) temporal dimension is also present, panel estimation methods are typically adopted in the economic literature, but factor based methods can be also applied. Classical estimation methods are not so suited, as their finite (T) sample properties are generally hardly known, while Bayesian estimation seems more promising, as it can easily handle a fixed T sample size and, with proper priors, also a large cross-sectional dimension. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

16 Introduction: Big Data Types Huge Huge : many variables and many observations, i.e., very large N and T. This is perhaps the most interesting type of data in a nowcasting context even if, unfortunately, it is not so often available. Big data collection started only recently, while collection of the target economic indicators started long ago, generally as far back as the 1950s or 1960s for many developed countries. Google Trends, publicly available summaries of a huge number of specific search queries in Google, are perhaps the best example in this category, and not by chance the most commonly used indicators in economic nowcasting exercises. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

17 Introduction: Big Data Types Huge Contrary to basic econometrics and statistics, in Huge datasets both T and N diverge, and proper techniques must be adopted to take this feature into account at the level of model specification, estimation and evaluation. For example, in principle it is still possible to consider information criteria such as BIC or AIC for model specification (indicator selection for the target variable in the nowcasting equation), although it is the case that modifications may be needed to account for the fact that N is comparable or larger than T, as opposed to much smaller as assumed in the derivations of information criteria. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

18 Introduction: Big Data Types Huge Further, in the case of a linear regression model with N regressors, 2 N alternative models should be compared, which is not computationally feasible when N is very large, so that efficient algorithms that only search specific subsets of the 2 N possible models have been developed. Moreover, the standard properties of the OLS estimator in the regression model are derived assuming that N is fixed and (much) smaller than T. Some properties are preserved, under certain conditions, also when N diverges but empirically the OLS estimator does not perform well due to collinearity problems that require a proper regularization of the second moment matrix of the regressors. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

19 Introduction: Big Data Types Huge This is in turn relevant for nowcasting, as the parameter estimators are used to construct the nowcast (or forecast). As a result of these problems for OLS, a number of regularisation, or penalisation methods have been suggested. An early approach, referred to as Ridge regression, uses shrinkage to ensure a well behaved regressor sample second moment matrix. More recently, other penalisation methods have been developed. A prominent example is LASSO where a penalty is added to the OLS objective function in the form of the sum of the absolute values of the coefficients. Many related penalisation methods have since been proposed and analysed. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

20 Introduction: Big Data Types Huge As an alternative to variable selection, the indicators could be summarized by means of principal components (or estimated factors) or related methods such as dynamic principal components or partial least squares. However, standard principal component analysis is also problematic when N gets very large, but fixes are available, such as the use of sparse principal component analysis. Finally, rather than selecting or summarizing the indicators, they could be all inserted in the nowcasting regression but imposing tight priors on their associated coefficients, which leads to specific Bayesian estimators. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

21 Introduction: Big Data Pros/Cons In a nowcasting context, we think of big data as complements rather than substitutes for more common coincident and leading indicators. Some of the problems discussed for internet based big data also apply to large datasets of conventional indicators. For example, collecting disaggregated macroeconomic and financial variables for an EU country easily leads to a few hundred indicators, and multiplying that for all the EU countries, e.g. in order to conduct a comparative analysis, leads to thousands of variables to be considered in formal econometric models, which can be hardly done with standard techniques. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

22 Introduction: Big Data Issues Data Availability. Most data pass through private providers and are related to personal aspects. Hence, continuity of data provision could not be guaranteed. For example, Google could stop providing Google Trends, or at least no longer make them available for free. Or online retail stores could forbid access to their websites to crawlers for automatic price collection. Or individuals could extend the use of softwares that prevent tracking their internet activities, or tracking could be more tightly regulated by law for privacy reasons. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

23 Introduction: Big Data Issues Digital Divide. The fact that a sizable fraction of the population still has no or limited internet access. This implies that the available data are subject to a sample selection bias, and this can matter for their use. Suppose, for example, that we want to nowcast unemployment at a disaggregate level, either by age or by regions. Internet data relative to older people or people resident in poorer regions could lead to underestimation of their unemployment level, as they have relatively little access to internet based search tools. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

24 Introduction: Big Data Issues Changing Size & Quality. A third issue is that both the size and the quality of internet data keeps changing over time, in general much faster than for standard data collection. For example, applications such as Twitter or WhatsApp were not available just a few years ago, and the number of their users increased exponentially, in particular in the first period after their introduction. Similarly, other applications can be gradually dismissed or used for different uses. For example, the fraction of goods sold by EBay through proper auctions is progressively declining over time, being replaced by other price formation mechanisms. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

25 Introduction: Big Data Issues Bias in Answers. Again more relevant for digital than standard data collection, is that individuals or businesses could not report truthfully their experiences, assessments and opinions. For example, some newspapers and other sites conduct online surveys about the feelings of their readers (happy, tired, angry, etc.) and one could think of using them, for example, to predict election outcomes, as a large fraction of happy people should be good for the ruling political party. But, if respondents are biased, the prediction could be also biased, and a large fraction of non-respondents could lead to substantial uncertainty. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

26 Introduction: Big Data Issues Data Format. A fifth issue is that data could not be available in a numerical format, or not in a directly usable numerical format. A similar issue emerges with standard surveys, for example on economic conditions, where discrete answers from a large number of respondents have to be somewhat summarized and transformed into a continuous index. However, the problem is more common and relevant with internet data. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

27 Introduction: Big Data Issues Irregularities. A final issue, again common also with standard data but more pervasive in internet data due to their high sampling frequency and broad collection set, relates to data irregularities: outliers, working days effects, missing observations, presence of seasonal / periodic patterns, etc. all of which require properly de-noising and smoothing the data. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

28 Introduction: Big Data Advantages Big data provide potentially relevant complementary information with respect to standard data, being based on rather different information sets. Moreover, they are timely available and, generally, they are not subject to subsequent revisions, all relevant features for potential coincident and leading indicators of economic activity. Finally, they could be helpful to provide a more granular perspective on the indicator of interest, both in the temporal and in the cross-sectional dimensions. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

29 Introduction: Big Data Advantages In the temporal dimension, they can used to update nowcasts at a given frequency, such as weekly or even daily, so that the policy and decision makers can promptly update their actions according to the new and more precise estimates. In the cross-sectional dimension, big data could provide relevant information on units, such as regions or sectors, not fully covered by traditional coincident and leading indicators. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

30 Introduction: Summary First, do we get any relevant insights? nowcast precision by using big data? In other words, can we improve Second, do we get a big data hubris? Again as anticipated, we think of big data based indicators as complements to existing soft and hard data-based indicators, and therefore we do not get a big data hubris. Third, do we risk false positives? Namely, can we get some big data based indicators that nowcast well just due to data snooping? Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

31 Introduction: Issues Summary Fourth, do we mistake correlations for causes? Fifth, do we use the proper econometric methods? Sixth, do we have instability due to Algorithm Dynamics or other causes (e.g., the financial crisis, more general institutional changes, the increasing use of internet, discontinuity in data provision, etc.)? Finally, do we allow for variable and model uncertainty? Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

32 Part 1: Literature Review Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

33 Literature Review We review the most important research papers on big data in three distinct areas: Big Data in macroeconomics, Variable selection and dimensional reduction for big data in macroeconomics, Nowcasting in macroeconomics. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

34 Literature Review The aim of the review is to try to answer the following questions: 1 What are possible big data sources in relation to the list of Eurostat s Unit C1 macroeconomic indicators (i.e. GDP, inflation and producer prices, employment and unemployment, industrial production index and retail trade deflated turnover)? 2 What are the advantages and disadvantages of the previously analysed sources? 3 What are the main types of statistical methods used in the big data in macroeconomics literature? 4 What are the possible gains generated either by the use of big data or new statistical methods or both in comparison with existing practices in the field of nowcasting? Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

35 Literature Review Based on the surveyed papers, we can say that the use of Google Trends has been dominant in studies using big data in macroeconomics. There exist some papers based on Twitter data, also reviewed, but they are mainly in finance. Webscrapping and collection of online prices also offer some potential, especially for nowcasting inflation. However, such datasets are very difficult to obtain (and possibly sustain), even more so when many countries and long enough samples are required. A similar comment applies for credit card and financial transactions data, and for data summaries resulting from textual analysis. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

36 Literature Review From the literature it also emerges that the advantages of using data like Google Trends are: more timely forecasts, not subject to data revision, some improvements in forecast accuracy, even though these typically emerge with respect to simple benchmarks (AR models), ease of data access and collection, ease of data management and treatment, expected good data quality, reasonable likelihood that similar data, will be available on a continuous basis and without major definitional changes. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

37 Literature Review There also some disadvantages when using this data source, the main ones being: a typical sole use of such data, which can lead to biased results ( big data hubris ), the impossibility to access the raw data, and the lack of knowledge of the precise algorithms used to pre-treat and summarize them, the possibility that free access will be discontinued by the (private) data provider, or limited due to the introduction of more stringent privacy laws. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

38 Literature Review Varian (2014) provides an intuitive introduction in big data management and manipulation. Big data, possibly after transformation to some kind of numerical format, has to be stored in some sort of database, as it is difficult to be dealt with spreadsheets. A medium sized dataset (i.e. about a million observations) could be stored and manipulated using a relational database, such as MySQL, whereas a dataset of several million observations could be efficiently stored and manipulated by NoSQL databases. Sometimes, and depending on the nature of the research, a carefully selected subsample or summary of the data might be sufficient for analysis. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

39 Literature Review Hence, big data creation, storage and management typically require specific IT skills, software and hardware. The associated costs should be kept into consideration when assessing the potential benefits of big data for nowcasting. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

40 Literature Review In terms of statistics and econometrics, data analysis is typically broken down into four categories: pre-treatment and summarisation, estimation, hypothesis testing and prediction. Since a large amount of data is available, penalised regressions such as LASSO, LARS, and elastic nets can be used instead of the standard linear or logistic regression. These techniques could also be used for variable selection. Then, the choice of the final model should come from forecasting crossvalidation so that the researcher makes sure the model has good out-ofsample predictive ability. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

41 Part 1: Literature Review Main Findings Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

42 Literature Review: Main Findings The purpose of the literature review is to answer four questions: 1 What are possible big data sources in relation to the list of Eurostat s Unit C1 macroeconomic indicators? 2 What are the advantages and disadvantages for each of the previously analysed sources? 3 What are the main types of statistical methods used in the big data in macroeconomics literature? 4 What are the possible gains generated either by the use of big data or new statistical methods or both in comparison with existing practices in the field of nowcasting? Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

43 Literature Review: Main Findings As anticipated in the Introduction, after a careful examination of the most important papers in each area we can say that the majority of big data papers are based on Google Trends as predictors. The advantages of using data like Google Trends include: a the improved timeliness of the forecasts without need for data revision, b the potential improvement of forecasts, c open access to the data, d easy data handling, e good data quality, f reasonable possibility that this sort of data will be released on a continuous basis. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

44 Literature Review: Main Findings However, one must be cautious when using data of this type as it is often associated with the following issues: a the use of Google data as the only data input could lead to biased results (commonly known as big data hubris ), b the restrictions to access the raw data but only the Google index, c the possibility that free access will be discontinued. Therefore, we suggest the use of Google Trends for nowcasting the macroeconomic variables of interest for Eurostat. We strongly believe that such data must be used as a supplement to current forecasting tools and not as a substitute. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

45 Literature Review: Main Findings Regarding questions (3) and (4), it turns out that most of the papers in the literature generate nowcasts based on mixed frequency versions of linear regressions, VARs, (dynamic) factor models or a combination of them, and adopt various strategies for variable selection in the presence of a large set of potential regressors. While no clear-cut ranking of the alternative methodologies emerge, there seems to be consensus about the usefulness of big data for nowcasting variables such as unemployment, GDP, inflation and surveys, even though the gains are often computed with respect to (too) simple benchmarks. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

46 Part 2: Big Data Modelling Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

47 Big Data Modelling Let y t, t = 1,..., T, be the target variable and x t = (x 1t,..., x Nt ) be a set of potential predictors, with N very large. We do not assume a particular data generating process for y t but simply posit the existence of a representation of the form which implies that E(u t x 1t,..., x Nt ) = 0. y t = a + g(x 1t,..., x Nt ) + u t, (1) While the potential nonlinearity in (1) might, in principle, be worth exploring, it is extremely difficult to model nonlinearities in the context of large datasets and no work is available on this in the big data literature. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

48 Big Data Modelling As a result, we consider an approximating linear representation of the form, y t = a + N β i x it + u t, (2) i=1 with u t denoting a martingale difference process and where the set of x it s can also contain products of the original indicators in order to provide a better approximation to (1). Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

49 Big Data Modelling Our main aim is to provide estimates for current and future values of y t, where either no, or only a preliminary, value for y t is available from official statistics. To do so, we can rely on many approaches, which can be categorised in three main strands. The first strand aims to provide estimates for β = (β 1,..., β N ). While ordinary least squares (OLS) is the benchmark method for doing so, it is clear that if N is large this is not optimal or even feasible (when N > T ). Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

50 Big Data Modelling So other methods need to be used. We consider two classes of methods. The first one is sparse regression, with origins in the machine learning literature. A main aim there is to stabilise the variability of the estimated β i. The second class considers the use of a variety of information criteria such as AIC or BIC to select a smaller subset of all the available predictors. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

51 Big Data Modelling The second strand consists of reducing the dimension of x t by producing a much smaller set of generated regressors, which can then be used to produce nowcasts and forecasts in standard ways. The third strand suggests the use of a (possibly very large) set of small models, one for each available indicator or small subset of them, and then the combination of the resulting many nowcasts or forecasts. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

52 Part 2: Big Data Modelling Main Findings Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

53 Big Data Modelling: Main Findings We feel that Multiple Testing (related to Boosting) and, potentially, some variant of LASSO could be the preferred approaches among the set of machine learning techniques. Among the data reduction techniques, PCA and, possibly PLS are promising. And it would be also worthwhile experimenting with Bayesian regression, with substantial shrinkage, and forecast combination, with simple equal weighting. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

54 Big Data Modelling: Main Findings Finally, all these approaches should be also modified to take into account the possibility of a different timing for the target and the indicator variables. We have surveyed a number of alternative methods to handle mixed frequencies, and it turns out that Unrestricted MIDAS or bridge modelling appear as the most promising approaches, as they preserve linearity and do not add an additional layer of computational complexity. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

55 Part 3: Empirical Analysis Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

56 Empirical Analysis: Introduction We consider nowcasting and short-term (one- to twelve- months ahead) forecasting three key macroeconomic variables: 1 inflation (measured by the growth rate in the Consumer Price Index), 2 growth in retail sales (measured by the growth rate of the Retail Trade Index), and 3 the Unemployment Rate. The exercise is conducted recursively in a pseudo out of sample framework, using monthly data for three economies: Germany (DE), Italy (IT) and the UK. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

57 Empirical Analysis: Introduction We assess the relative performance of: Big Data (proxied via weekly Google Trends) and standard indicators (based on a large set of weekly and monthly economic and financial variables) In fact, as we have mentioned several times, we think of Big Data as providing complementary information, and we wish to assess how useful it is in a forecasting context relative to standard indicators. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

58 Empirical Analysis: Introduction We also evaluate the role of several econometric methods and alternative specifications for each of them (with or without big data), also capable of handling the frequency mismatch in our data. Specifically, we consider: Naive and autoregressive (AR) models as benchmarks Dynamic Factor Analysis (DFA) Partial Least Squares (PLS) Bayesian Regression (BR) and LASSO regression. DFA and PLS are representatives of data reduction methods; BR and LASSO are representatives of, respectively, econometric and machine learning techniques. In addition, we also consider model averaging. Overall, we have a total of 255 models and model combinations. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

59 Empirical Analysis: Introduction In general, we find that factors extracted using a combination of standard macroeconomic and Big Data predictors lead to a substantial improvement in the nowcasting and forecasting performance for all variables across the three economies, though the gains are small for unemployment, which is a very persistent variable over our evaluation sample which makes beating a simple AR model difficult, in particular at long horizons. Furthermore, a data-driven automated model selection strategy, where the forecasts from a set of best performing models over the recent past are pooled, performs particularly well, with Big Data present in about 45% of the pooled models. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

60 Empirical Analysis: Conclusions In general, we find the DFA and BR often provide accurate results with and without Google Trends. Most of the suggested models and model averages perform better than the benchmark when nowcasting/forecasting the CPI and RTI, however they fail using the unemployment rate. This is due to structure of the unemployment rate series which is slowly moving in our evaluation months. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

61 Empirical Analysis: Conclusions A longer evaluation period could reverse this finding, and provide more reliable results, but unfortunately we are constrained by the availability of Google Trends, which start in Our suggested data-driven automated strategy, which rotates models based on their forecasting performance in the past 1, 6 and 12 months, seems to work extremely well for CPI and RTI, and decently for unemployment, and therefore it provides a powerful tool for applied forecasting. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

62 Empirical Analysis: Conclusions The detailed analysis of model rotation shows that, on average, 45% of the times the chosen best models for averaging, for the CPI and RTI target variables across all three economies, include Google Trends. Hence, we conclude that Big Data, as proxied via Google Trends in our applications and combined with standard macroeconomic and financial indicators, can indeed improve the nowcasting and short term forecasting performance of econometric models. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

63 Conclusions Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

64 Overall Recommendations Overall, our suggestion is to take a pragmatic approach that balances potential gains and costs from the use of Big Data for nowcasting macroeconomic indicators, in addition to standard indicators. A preliminary step should be an a priori assessment of the potential usefulness of Big Data for a specific indicator of interest, such as GDP growth, inflation or unemployment. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

65 Overall Recommendations This requires to evaluate the quality of the existing nowcasts and whether any identified problems, such as bias or inefficiency or large errors in specific periods, can be fixed by adding information as potentially available in Big Data based indicators. Similarly, it should be considered whether these additional indicators could improve the timeliness, frequency of release and extent of revision of the nowcasts. Relevant information can be gathered by looking at existing empirical studies focusing on similar variables or countries, and in this respect the extensive literature review we presented can be quite helpful. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

66 Overall Recommendations Once Big Data passes the need check in the preliminary step, the first proper step of the Big Data based nowcasting exercise is a careful search for the specific Big Data to be collected. As we have seen, there are many potential providers, which can be grouped into Social Networks, Traditional Business Systems, and the Internet of Things. Naturally, it is not possible to give general guidelines on a preferred data source, as its choice is heavily dependent on the target indicator of the nowcasting exercise. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

67 Overall Recommendations Having identified the preferred source of Big Data, the second step requires to assess the availability and quality of the data. A relevant issue is whether direct data collection is needed, which can be very costly, or a provider makes the data available. In case a provider is available, its reliability (and cost) should be assessed, together with the availability of meta data, the likelihood that continuity of data provision is guaranteed, and the possibility of customization (e.g., make the data available at higher frequency, with a particular disaggregation, for a longer sample, etc.). Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

68 Overall Recommendations All these aspects are particularly relevant in the context of applications in official statistical offices. As the specific goal is nowcasting, it should be also carefully checked that the temporal dimension of the Big Data is long and homogeneous enough to allow for proper model estimation and evaluation of the resulting nowcasts. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

69 Overall Recommendations The third step analyzes specific features of the collected Big Data. A first issue that is sometimes neglected is the amount of the required storage space and the associated need of specific hardware and software for storing and handling the Big Data. A second issue is the type of the Big Data, as it is often unstructured and may require a transformation into cross-sectional or time series observations. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

70 Overall Recommendations Even when already available in numerical format, pre-treatment of the Big Data is often needed to remove deterministic patterns and deal with data irregularities, such as outliers and missing observations. While standard methods can be usually applied, the size of the datasets suggests to resort to robust and computationally simple approaches, applied variable by variable. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

71 Overall Recommendations The fourth step requires to assess the presence of a possible bias in the answers provided by the Big Data, due to the digital divide or the tendency of individuals and businesses not to report truthfully their experiences, assessments and opinions. A related problem, particularly relevant for nowcasting, is the possible instability of the relationship with the target variable. This is a common problem also with standard indicators, as the type and size of economic shocks that hit the economy vary over time. Both issues can be however tackled at the modelling and evaluation stages. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

72 Overall Recommendations The fifth step when nowcasting with Big Data requires to select the proper econometric technique. Here, it is important to be systematic about the correspondence between the nature of the Big Data setting and use under investigation and the method that is used. There is a number of dimensions along which we wish to differentiate. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

73 Overall Recommendations 1 The first choice is between the use of methods suited for large but not huge datasets, and therefore applied to summaries of the Big Data (such as Google Trends, commonly used in nowcasting applications), or of techniques specifically designed for Big Data. For example, nowcasting with large datasets can be based on factor models, large BVARs, or shrinkage regressions. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

74 Overall Recommendations 1 Huge datasets can be handled by sparse principal components, linear models combined with heuristic optimization,or a variety of machine learning methods (which, though, are generally developed assuming i.i.d. variables). It is difficult to provide an a priori ranking of all these techniques and there are few empirical comparisons and even fewer in a nowcasting context, so that is may be appropriate to apply and compare a few of them for nowcasting the specific indicator of interest. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

75 Overall Recommendations 2 A second dimension is the frequency of the available data. If this frequency is mixed then specific techniques for mixed frequency data become relevant. Chief among them is unrestricted MIDAS which provides a very flexible framework of analysis and can be adapted to work together with most if not all Big Data methods be they machine learning of econometric. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

76 Overall Recommendations 3 Yet another dimension relates to the purpose for which large datasets are considered. Possibilities include model or indicator selection, forecasting or a more structural analysis. In this case of course each purpose is best served by different methods and the choice of method crucially depends on the purpose. Most methods can be used for forecasting and so the choice has to be case dependent. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

77 Bayesian VAR models stand out as an appropriate method in this context. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79 Overall Recommendations 3 We recommend that as many methods are possible are evaluated in a forecasting context although past experience suggests that factor analysis and shrinkage methods can be of great use. For model or indicator selection penalised regression and the MT methods seem to be appropriate and also have been reported to have good potential. Finally, for more structural analysis it is clear that it is likely that huge datasets are more difficult to accommodate. In this case, system methods that analyse the whole or a large proportion of the available data simultaneously, seem necessary for a satisfactory analytical outcome.

78 Overall Recommendations The final step consists of a critical and comprehensive assessment of the contribution of Big Data for nowcasting the indicator of interest. In order to avoid, or at least reduce the extent of, data and model snooping, a cross-validation approach should be followed, whereby various models and indicators are estimated over a first sample and they are selected and/or pooled according to their performance, but then the performance of the preferred approaches is re-evaluated over a second sample. This procedure, which we have implemented in the empirical evaluation, provides a reliable assessment of the gains in terms of enhanced nowcasting performance from the use of Big Data. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

79 Overall Recommendations To conclude, we are very confident that Big Data are precious also in a nowcasting context, not only to reduce the errors but also to improve the timeliness, frequency of release and extent of revision. We hope that the approach we have developed in this project will be useful for many users. Kapetanios, Marcellino, Papailias Big Data & Nowcasting April 7th / 79

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014 Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive

More information

WP6 Early estimates of economic indicators. WP6 coordinator: Tomaž Špeh, SURS ESSNet Big data: BDES 2018 Sofia 14.,

WP6 Early estimates of economic indicators. WP6 coordinator: Tomaž Špeh, SURS ESSNet Big data: BDES 2018 Sofia 14., WP6 Early estimates of economic indicators WP6 coordinator: Tomaž Špeh, SURS ESSNet Big data: BDES 2018 Sofia 14.,15.5.2018 WP6 objectives Outline Summary of activities carried out and results achieved

More information

NATCOR. Forecast Evaluation. Forecasting with ARIMA models. Nikolaos Kourentzes

NATCOR. Forecast Evaluation. Forecasting with ARIMA models. Nikolaos Kourentzes NATCOR Forecast Evaluation Forecasting with ARIMA models Nikolaos Kourentzes n.kourentzes@lancaster.ac.uk O u t l i n e 1. Bias measures 2. Accuracy measures 3. Evaluation schemes 4. Prediction intervals

More information

Volume 38, Issue 2. Nowcasting the New Turkish GDP

Volume 38, Issue 2. Nowcasting the New Turkish GDP Volume 38, Issue 2 Nowcasting the New Turkish GDP Barış Soybilgen İstanbul Bilgi University Ege Yazgan İstanbul Bilgi University Abstract In this study, we predict year-on-year and quarter-on-quarter Turkish

More information

Introduction to Forecasting

Introduction to Forecasting Introduction to Forecasting Introduction to Forecasting Predicting the future Not an exact science but instead consists of a set of statistical tools and techniques that are supported by human judgment

More information

Macroeconomic nowcasting with big data through the lens of a sparse factor model 1

Macroeconomic nowcasting with big data through the lens of a sparse factor model 1 Macroeconomic nowcasting with big data through the lens of a sparse factor model 1 Laurent Ferrara (Banque de France) Anna Simoni (CREST, CNRS, ENSAE, École Polytechnique ) ECB Forecasting Conference June

More information

Euro-indicators Working Group

Euro-indicators Working Group Euro-indicators Working Group Luxembourg, 9 th & 10 th June 2011 Item 9.3 of the Agenda Towards an early warning system for the Euro area By Gian Luigi Mazzi Doc 308/11 Introduction Clear picture of economic

More information

NOWCASTING THE NEW TURKISH GDP

NOWCASTING THE NEW TURKISH GDP CEFIS WORKING PAPER SERIES First Version: August 2017 NOWCASTING THE NEW TURKISH GDP Barış Soybilgen, İstanbul Bilgi University Ege Yazgan, İstanbul Bilgi University Nowcasting the New Turkish GDP Barış

More information

Advances in promotional modelling and analytics

Advances in promotional modelling and analytics Advances in promotional modelling and analytics High School of Economics St. Petersburg 25 May 2016 Nikolaos Kourentzes n.kourentzes@lancaster.ac.uk O u t l i n e 1. What is forecasting? 2. Forecasting,

More information

ESRI Research Note Nowcasting and the Need for Timely Estimates of Movements in Irish Output

ESRI Research Note Nowcasting and the Need for Timely Estimates of Movements in Irish Output ESRI Research Note Nowcasting and the Need for Timely Estimates of Movements in Irish Output David Byrne, Kieran McQuinn and Ciara Morley Research Notes are short papers on focused research issues. Nowcasting

More information

Comment on: Automated Short-Run Economic Forecast (ASEF) By Nicolas Stoffels. Bank of Canada Workshop October 25-26, 2007

Comment on: Automated Short-Run Economic Forecast (ASEF) By Nicolas Stoffels. Bank of Canada Workshop October 25-26, 2007 Background material Comment on: Automated Short-Run Economic Forecast (ASEF) By Nicolas Stoffels Bank of Canada Workshop October 25-26, 2007 André Binette (Bank of Canada) 1 Summary of ASEF 1. Automated

More information

Assessing the use of Google Trends to predict credit developments 1. Edwige Burdeau* Banque de France, Paris, France

Assessing the use of Google Trends to predict credit developments 1. Edwige Burdeau* Banque de France, Paris, France Assessing the use of Google Trends to predict credit developments 1 Edwige Burdeau* Banque de France, Paris, France edwige.burdeau@banque-france.fr Etienne Kintzler Banque de France, Paris, France etienne.kintzler@banque-france.fr

More information

Assistant Prof. Abed Schokry. Operations and Productions Management. First Semester

Assistant Prof. Abed Schokry. Operations and Productions Management. First Semester Chapter 3 Forecasting Assistant Prof. Abed Schokry Operations and Productions Management First Semester 2010 2011 Chapter 3: Learning Outcomes You should be able to: List the elements of a good forecast

More information

The Econometric Analysis of Mixed Frequency Data with Macro/Finance Applications

The Econometric Analysis of Mixed Frequency Data with Macro/Finance Applications The Econometric Analysis of Mixed Frequency Data with Macro/Finance Applications Instructor: Eric Ghysels Structure of Course It is easy to collect and store large data sets, particularly of financial

More information

International Seminar on Early Warning and Business Cycle Indicators. 14 to 16 December 2009 Scheveningen, The Netherlands

International Seminar on Early Warning and Business Cycle Indicators. 14 to 16 December 2009 Scheveningen, The Netherlands ESA/STAT/AC.202/S2.2 International Seminar on Early Warning and Business Cycle Indicators 14 to 16 December 2009 Scheveningen, The Netherlands Overview of terminology for high frequency indicators Roberto

More information

Chapter 13: Forecasting

Chapter 13: Forecasting Chapter 13: Forecasting Assistant Prof. Abed Schokry Operations and Productions Management First Semester 2013-2014 Chapter 13: Learning Outcomes You should be able to: List the elements of a good forecast

More information

Nowcasting Norwegian GDP

Nowcasting Norwegian GDP Nowcasting Norwegian GDP Knut Are Aastveit and Tørres Trovik May 13, 2007 Introduction Motivation The last decades of advances in information technology has made it possible to access a huge amount of

More information

Machine Learning Linear Regression. Prof. Matteo Matteucci

Machine Learning Linear Regression. Prof. Matteo Matteucci Machine Learning Linear Regression Prof. Matteo Matteucci Outline 2 o Simple Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression o Multi Variate Regession Model Least Squares

More information

Linear regression methods

Linear regression methods Linear regression methods Most of our intuition about statistical methods stem from linear regression. For observations i = 1,..., n, the model is Y i = p X ij β j + ε i, j=1 where Y i is the response

More information

Forecasting with Expert Opinions

Forecasting with Expert Opinions CS 229 Machine Learning Forecasting with Expert Opinions Khalid El-Awady Background In 2003 the Wall Street Journal (WSJ) introduced its Monthly Economic Forecasting Survey. Each month the WSJ polls between

More information

Mixed frequency models with MA components

Mixed frequency models with MA components Mixed frequency models with MA components Claudia Foroni a Massimiliano Marcellino b Dalibor Stevanović c a Deutsche Bundesbank b Bocconi University, IGIER and CEPR c Université du Québec à Montréal September

More information

Are Forecast Updates Progressive?

Are Forecast Updates Progressive? MPRA Munich Personal RePEc Archive Are Forecast Updates Progressive? Chia-Lin Chang and Philip Hans Franses and Michael McAleer National Chung Hsing University, Erasmus University Rotterdam, Erasmus University

More information

Discussion of Predictive Density Combinations with Dynamic Learning for Large Data Sets in Economics and Finance

Discussion of Predictive Density Combinations with Dynamic Learning for Large Data Sets in Economics and Finance Discussion of Predictive Density Combinations with Dynamic Learning for Large Data Sets in Economics and Finance by Casarin, Grassi, Ravazzolo, Herman K. van Dijk Dimitris Korobilis University of Essex,

More information

Using Temporal Hierarchies to Predict Tourism Demand

Using Temporal Hierarchies to Predict Tourism Demand Using Temporal Hierarchies to Predict Tourism Demand International Symposium on Forecasting 29 th June 2015 Nikolaos Kourentzes Lancaster University George Athanasopoulos Monash University n.kourentzes@lancaster.ac.uk

More information

Linear Model Selection and Regularization

Linear Model Selection and Regularization Linear Model Selection and Regularization Recall the linear model Y = β 0 + β 1 X 1 + + β p X p + ɛ. In the lectures that follow, we consider some approaches for extending the linear model framework. In

More information

GDP growth and inflation forecasting performance of Asian Development Outlook

GDP growth and inflation forecasting performance of Asian Development Outlook and inflation forecasting performance of Asian Development Outlook Asian Development Outlook (ADO) has been the flagship publication of the Asian Development Bank (ADB) since 1989. Issued twice a year

More information

FaMIDAS: A Mixed Frequency Factor Model with MIDAS structure

FaMIDAS: A Mixed Frequency Factor Model with MIDAS structure FaMIDAS: A Mixed Frequency Factor Model with MIDAS structure Frale C., Monteforte L. Computational and Financial Econometrics Limassol, October 2009 Introduction After the recent financial and economic

More information

Bayesian Variable Selection for Nowcasting Time Series

Bayesian Variable Selection for Nowcasting Time Series Bayesian Variable Selection for Time Series Steve Scott Hal Varian Google August 14, 2013 What day of the week are there the most searches for [hangover]? 1. Sunday 2. Monday 3. Tuesday 4. Wednesday 5.

More information

Identifying the Monetary Policy Shock Christiano et al. (1999)

Identifying the Monetary Policy Shock Christiano et al. (1999) Identifying the Monetary Policy Shock Christiano et al. (1999) The question we are asking is: What are the consequences of a monetary policy shock a shock which is purely related to monetary conditions

More information

FORECASTING STANDARDS CHECKLIST

FORECASTING STANDARDS CHECKLIST FORECASTING STANDARDS CHECKLIST An electronic version of this checklist is available on the Forecasting Principles Web site. PROBLEM 1. Setting Objectives 1.1. Describe decisions that might be affected

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

SGA-2: WP6- Early estimates. SURS: Boro Nikić, Tomaž Špeh ESSnet Big Data Project Brussels 26.,

SGA-2: WP6- Early estimates. SURS: Boro Nikić, Tomaž Špeh ESSnet Big Data Project Brussels 26., SGA-2: WP6- Early estimates SURS: Boro Nikić, Tomaž Špeh ESSnet Big Data Project Brussels 26.,27.10. 2017 Outline WP6 state of affairs WP6 pilot: Early estimates of economic indicators Methods Data sources

More information

TESTING FOR CO-INTEGRATION

TESTING FOR CO-INTEGRATION Bo Sjö 2010-12-05 TESTING FOR CO-INTEGRATION To be used in combination with Sjö (2008) Testing for Unit Roots and Cointegration A Guide. Instructions: Use the Johansen method to test for Purchasing Power

More information

UPPSALA UNIVERSITY - DEPARTMENT OF STATISTICS MIDAS. Forecasting quarterly GDP using higherfrequency

UPPSALA UNIVERSITY - DEPARTMENT OF STATISTICS MIDAS. Forecasting quarterly GDP using higherfrequency UPPSALA UNIVERSITY - DEPARTMENT OF STATISTICS MIDAS Forecasting quarterly GDP using higherfrequency data Authors: Hanna Lindgren and Victor Nilsson Supervisor: Lars Forsberg January 12, 2015 We forecast

More information

3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative?

3. If a forecast is too high when compared to an actual outcome, will that forecast error be positive or negative? 1. Does a moving average forecast become more or less responsive to changes in a data series when more data points are included in the average? 2. Does an exponential smoothing forecast become more or

More information

Discussion Papers. Google Searches as a Means of Improving the Nowcasts of Key Macroeconomic Variables

Discussion Papers. Google Searches as a Means of Improving the Nowcasts of Key Macroeconomic Variables Deutsches Institut für Wirtschaftsforschung www.diw.de Discussion Papers 946 Konstantin A. Kholodilin Maximilian Podstawski Boriss Siliverstovs Constantin Bürgi Google Searches as a Means of Improving

More information

No Staff Memo. Evaluating real-time forecasts from Norges Bank s system for averaging models. Anne Sofie Jore, Norges Bank Monetary Policy

No Staff Memo. Evaluating real-time forecasts from Norges Bank s system for averaging models. Anne Sofie Jore, Norges Bank Monetary Policy No. 12 2012 Staff Memo Evaluating real-time forecasts from Norges Bank s system for averaging models Anne Sofie Jore, Norges Bank Monetary Policy Staff Memos present reports and documentation written by

More information

THE LONG-RUN DETERMINANTS OF MONEY DEMAND IN SLOVAKIA MARTIN LUKÁČIK - ADRIANA LUKÁČIKOVÁ - KAROL SZOMOLÁNYI

THE LONG-RUN DETERMINANTS OF MONEY DEMAND IN SLOVAKIA MARTIN LUKÁČIK - ADRIANA LUKÁČIKOVÁ - KAROL SZOMOLÁNYI 92 Multiple Criteria Decision Making XIII THE LONG-RUN DETERMINANTS OF MONEY DEMAND IN SLOVAKIA MARTIN LUKÁČIK - ADRIANA LUKÁČIKOVÁ - KAROL SZOMOLÁNYI Abstract: The paper verifies the long-run determinants

More information

Machine Learning for OR & FE

Machine Learning for OR & FE Machine Learning for OR & FE Regression II: Regularization and Shrinkage Methods Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com

More information

Forecasting Chapter 3

Forecasting Chapter 3 Forecasting Chapter 3 Introduction Current factors and conditions Past experience in a similar situation 2 Accounting. New product/process cost estimates, profit projections, cash management. Finance.

More information

Are Forecast Updates Progressive?

Are Forecast Updates Progressive? CIRJE-F-736 Are Forecast Updates Progressive? Chia-Lin Chang National Chung Hsing University Philip Hans Franses Erasmus University Rotterdam Michael McAleer Erasmus University Rotterdam and Tinbergen

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

EUROPEAN ECONOMY EUROPEAN COMMISSION DIRECTORATE-GENERAL FOR ECONOMIC AND FINANCIAL AFFAIRS

EUROPEAN ECONOMY EUROPEAN COMMISSION DIRECTORATE-GENERAL FOR ECONOMIC AND FINANCIAL AFFAIRS EUROPEAN ECONOMY EUROPEAN COMMISSION DIRECTORATE-GENERAL FOR ECONOMIC AND FINANCIAL AFFAIRS ECONOMIC PAPERS ISSN 1725-3187 http://europa.eu.int/comm/economy_finance N 249 June 2006 The Stacked Leading

More information

WRF Webcast. Improving the Accuracy of Short-Term Water Demand Forecasts

WRF Webcast. Improving the Accuracy of Short-Term Water Demand Forecasts No part of this presentation may be copied, reproduced, or otherwise utilized without permission. WRF Webcast Improving the Accuracy of Short-Term Water Demand Forecasts August 29, 2017 Presenters Maureen

More information

NOWCASTING GDP IN GREECE: A NOTE ON FORECASTING IMPROVEMENTS FROM THE USE OF BRIDGE MODELS

NOWCASTING GDP IN GREECE: A NOTE ON FORECASTING IMPROVEMENTS FROM THE USE OF BRIDGE MODELS South-Eastern Europe Journal of Economics 1 (2015) 85-100 NOWCASTING GDP IN GREECE: A NOTE ON FORECASTING IMPROVEMENTS FROM THE USE OF BRIDGE MODELS DIMITRA LAMPROU * University of Peloponnese, Tripoli,

More information

The prediction of house price

The prediction of house price 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

Forecasting in the presence of recent structural breaks

Forecasting in the presence of recent structural breaks Forecasting in the presence of recent structural breaks Second International Conference in memory of Carlo Giannini Jana Eklund 1, George Kapetanios 1,2 and Simon Price 1,3 1 Bank of England, 2 Queen Mary

More information

Lecture 2: Univariate Time Series

Lecture 2: Univariate Time Series Lecture 2: Univariate Time Series Analysis: Conditional and Unconditional Densities, Stationarity, ARMA Processes Prof. Massimo Guidolin 20192 Financial Econometrics Spring/Winter 2017 Overview Motivation:

More information

Bayesian modelling of real GDP rate in Romania

Bayesian modelling of real GDP rate in Romania Bayesian modelling of real GDP rate in Romania Mihaela SIMIONESCU 1* 1 Romanian Academy, Institute for Economic Forecasting, Bucharest, Calea 13 Septembrie, no. 13, District 5, Bucharest 1 Abstract The

More information

The aggregation problem in its hystorical perspective: a summary overview

The aggregation problem in its hystorical perspective: a summary overview The aggregation problem in its hystorical perspective: a summary overview WYE CITY GROUP Global Conference on Agricultural and Rural Household Statistics By GIANCARLO LUTERO Presented by EDOARDO PIZZOLI

More information

Quarterly Bulletin 2018 Q3. Topical article Gauging the globe: the Bank s approach to nowcasting world GDP. Bank of England 2018 ISSN

Quarterly Bulletin 2018 Q3. Topical article Gauging the globe: the Bank s approach to nowcasting world GDP. Bank of England 2018 ISSN Quarterly Bulletin 2018 Q3 Topical article Gauging the globe: the Bank s approach to nowcasting world GDP Bank of England 2018 ISSN 2399-4568 Topical articles The Bank s approach to nowcasting world GDP

More information

Least angle regression for time series forecasting with many predictors. Sarah Gelper & Christophe Croux Faculty of Business and Economics K.U.

Least angle regression for time series forecasting with many predictors. Sarah Gelper & Christophe Croux Faculty of Business and Economics K.U. Least angle regression for time series forecasting with many predictors Sarah Gelper & Christophe Croux Faculty of Business and Economics K.U.Leuven I ve got all these variables, but I don t know which

More information

10. Time series regression and forecasting

10. Time series regression and forecasting 10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the

More information

14 Random Variables and Simulation

14 Random Variables and Simulation 14 Random Variables and Simulation In this lecture note we consider the relationship between random variables and simulation models. Random variables play two important roles in simulation models. We assume

More information

Chapter 5: Forecasting

Chapter 5: Forecasting 1 Textbook: pp. 165-202 Chapter 5: Forecasting Every day, managers make decisions without knowing what will happen in the future 2 Learning Objectives After completing this chapter, students will be able

More information

This is the author s final accepted version.

This is the author s final accepted version. Bagdatoglou, G., Kontonikas, A., and Wohar, M. E. (2015) Forecasting US inflation using dynamic general-to-specific model selection. Bulletin of Economic Research, 68(2), pp. 151-167. (doi:10.1111/boer.12041)

More information

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Econ 423 Lecture Notes: Additional Topics in Time Series 1 Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes

More information

Is the Purchasing Managers Index a Reliable Indicator of GDP Growth?

Is the Purchasing Managers Index a Reliable Indicator of GDP Growth? Is the Purchasing Managers Index a Reliable Indicator of GDP Growth? Some Evidence from Indian Data i c r a b u l l e t i n Suchismita Bose Abstract Purchasing Managers Index (PMI) surveys have been developed

More information

Identifying SVARs with Sign Restrictions and Heteroskedasticity

Identifying SVARs with Sign Restrictions and Heteroskedasticity Identifying SVARs with Sign Restrictions and Heteroskedasticity Srečko Zimic VERY PRELIMINARY AND INCOMPLETE NOT FOR DISTRIBUTION February 13, 217 Abstract This paper introduces a new method to identify

More information

The Prediction of Monthly Inflation Rate in Romania 1

The Prediction of Monthly Inflation Rate in Romania 1 Economic Insights Trends and Challenges Vol.III (LXVI) No. 2/2014 75-84 The Prediction of Monthly Inflation Rate in Romania 1 Mihaela Simionescu Institute for Economic Forecasting of the Romanian Academy,

More information

Economics 390 Economic Forecasting

Economics 390 Economic Forecasting Economics 390 Economic Forecasting Prerequisite: Econ 410 or equivalent Course information is on website Office Hours Tuesdays & Thursdays 2:30 3:30 or by appointment Textbooks Forecasting for Economics

More information

International Seminar on Early Warning and Business Cycle Indicators. 14 to 16 December 2009 Scheveningen, The Netherlands

International Seminar on Early Warning and Business Cycle Indicators. 14 to 16 December 2009 Scheveningen, The Netherlands ESA/STAT/AC. 202/S5.3 International Seminar on Early Warning and Business Cycle Indicators 14 to 16 December 2009 Scheveningen, The Netherlands The Euro-area recession and nowcasting GDP growth using statistical

More information

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical & Industrial Engineering Indian Institute of Technology, Roorkee

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical & Industrial Engineering Indian Institute of Technology, Roorkee Industrial Engineering Prof. Inderdeep Singh Department of Mechanical & Industrial Engineering Indian Institute of Technology, Roorkee Module - 04 Lecture - 05 Sales Forecasting - II A very warm welcome

More information

Forecasting Using Time Series Models

Forecasting Using Time Series Models Forecasting Using Time Series Models Dr. J Katyayani 1, M Jahnavi 2 Pothugunta Krishna Prasad 3 1 Professor, Department of MBA, SPMVV, Tirupati, India 2 Assistant Professor, Koshys Institute of Management

More information

NOWCASTING REPORT. Updated: September 23, 2016

NOWCASTING REPORT. Updated: September 23, 2016 NOWCASTING REPORT Updated: September 23, 216 The FRBNY Staff Nowcast stands at 2.3% and 1.2% for 216:Q3 and 216:Q4, respectively. Negative news since the report was last published two weeks ago pushed

More information

NOWCASTING REPORT. Updated: October 21, 2016

NOWCASTING REPORT. Updated: October 21, 2016 NOWCASTING REPORT Updated: October 21, 216 The FRBNY Staff Nowcast stands at 2.2% for 216:Q3 and 1.4% for 216:Q4. Overall this week s news had a negative effect on the nowcast. The most notable developments

More information

NOWCASTING REPORT. Updated: April 15, 2016

NOWCASTING REPORT. Updated: April 15, 2016 NOWCASTING REPORT Updated: April 15, 2016 GDP growth prospects remain moderate for the rst half of the year: the nowcasts stand at 0.8% for 2016:Q1 and 1.2% for 2016:Q2. News from this week's data releases

More information

Nowcasting the Finnish economy with a large Bayesian vector autoregressive model

Nowcasting the Finnish economy with a large Bayesian vector autoregressive model Nowcasting the Finnish economy with a large Bayesian vector autoregressive model Itkonen,J. & Juvonen,P. Suomen Pankki Conference on Real-Time Data Analysis, Methods and Applications 19.10.2017 Introduction

More information

Time Series Analysis. Smoothing Time Series. 2) assessment of/accounting for seasonality. 3) assessment of/exploiting "serial correlation"

Time Series Analysis. Smoothing Time Series. 2) assessment of/accounting for seasonality. 3) assessment of/exploiting serial correlation Time Series Analysis 2) assessment of/accounting for seasonality This (not surprisingly) concerns the analysis of data collected over time... weekly values, monthly values, quarterly values, yearly values,

More information

Lecture 1: OLS derivations and inference

Lecture 1: OLS derivations and inference Lecture 1: OLS derivations and inference Econometric Methods Warsaw School of Economics (1) OLS 1 / 43 Outline 1 Introduction Course information Econometrics: a reminder Preliminary data exploration 2

More information

Lecture 2. Judging the Performance of Classifiers. Nitin R. Patel

Lecture 2. Judging the Performance of Classifiers. Nitin R. Patel Lecture 2 Judging the Performance of Classifiers Nitin R. Patel 1 In this note we will examine the question of how to udge the usefulness of a classifier and how to compare different classifiers. Not only

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Introduction to Time Series Regression and Forecasting (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Introduction to Time Series Regression

More information

How News and Its Context Drive Risk and Returns Around the World

How News and Its Context Drive Risk and Returns Around the World How News and Its Context Drive Risk and Returns Around the World Charles Calomiris and Harry Mamaysky Columbia Business School Q Group Spring 2018 Meeting Outline of talk Introduction Data and text measures

More information

NOWCASTING REPORT. Updated: August 17, 2018

NOWCASTING REPORT. Updated: August 17, 2018 NOWCASTING REPORT Updated: August 17, 2018 The New York Fed Staff Nowcast for 2018:Q3 stands at 2.4%. News from this week s data releases decreased the nowcast for 2018:Q3 by 0.2 percentage point. Negative

More information

Least Absolute Shrinkage is Equivalent to Quadratic Penalization

Least Absolute Shrinkage is Equivalent to Quadratic Penalization Least Absolute Shrinkage is Equivalent to Quadratic Penalization Yves Grandvalet Heudiasyc, UMR CNRS 6599, Université de Technologie de Compiègne, BP 20.529, 60205 Compiègne Cedex, France Yves.Grandvalet@hds.utc.fr

More information

6. Assessing studies based on multiple regression

6. Assessing studies based on multiple regression 6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal

More information

The value of competitive information in forecasting FMCG retail product sales and category effects

The value of competitive information in forecasting FMCG retail product sales and category effects The value of competitive information in forecasting FMCG retail product sales and category effects Professor Robert Fildes r.fildes@lancaster.ac.uk Dr Tao Huang t.huang@lancaster.ac.uk Dr Didier Soopramanien

More information

PhD/MA Econometrics Examination January 2012 PART A

PhD/MA Econometrics Examination January 2012 PART A PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator

More information

An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso

An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso PIER Exchange Nov. 17, 2016 Thammarak Moenjak What is machine learning? Wikipedia

More information

Mining Big Data Using Parsimonious Factor and Shrinkage Methods

Mining Big Data Using Parsimonious Factor and Shrinkage Methods Mining Big Data Using Parsimonious Factor and Shrinkage Methods Hyun Hak Kim 1 and Norman Swanson 2 1 Bank of Korea and 2 Rutgers University ECB Workshop on using Big Data for Forecasting and Statistics

More information

A Course on Advanced Econometrics

A Course on Advanced Econometrics A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.

More information

ECON3327: Financial Econometrics, Spring 2016

ECON3327: Financial Econometrics, Spring 2016 ECON3327: Financial Econometrics, Spring 2016 Wooldridge, Introductory Econometrics (5th ed, 2012) Chapter 11: OLS with time series data Stationary and weakly dependent time series The notion of a stationary

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS INSTITUTE AND FACULTY OF ACTUARIES Curriculum 09 SPECIMEN SOLUTIONS Subject CSA Risk Modelling and Survival Analysis Institute and Faculty of Actuaries Sample path A continuous time, discrete state process

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Euro-indicators Working Group

Euro-indicators Working Group Euro-indicators Working Group Luxembourg, 9 th & 10 th June 2011 Item 9.4 of the Agenda New developments in EuroMIND estimates Rosa Ruggeri Cannata Doc 309/11 What is EuroMIND? EuroMIND is a Monthly INDicator

More information

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis Sílvia Gonçalves and Benoit Perron Département de sciences économiques,

More information

EC4051 Project and Introductory Econometrics

EC4051 Project and Introductory Econometrics EC4051 Project and Introductory Econometrics Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Intro to Econometrics 1 / 23 Project Guidelines Each student is required to undertake

More information

Forecasting Practice: Decision Support System to Assist Judgmental Forecasting

Forecasting Practice: Decision Support System to Assist Judgmental Forecasting Forecasting Practice: Decision Support System to Assist Judgmental Forecasting Gauresh Rajadhyaksha Dept. of Electrical and Computer Engg. University of Texas at Austin Austin, TX 78712 Email: gaureshr@mail.utexas.edu

More information

MS-C1620 Statistical inference

MS-C1620 Statistical inference MS-C1620 Statistical inference 10 Linear regression III Joni Virta Department of Mathematics and Systems Analysis School of Science Aalto University Academic year 2018 2019 Period III - IV 1 / 32 Contents

More information

Chapter 8 - Forecasting

Chapter 8 - Forecasting Chapter 8 - Forecasting Operations Management by R. Dan Reid & Nada R. Sanders 4th Edition Wiley 2010 Wiley 2010 1 Learning Objectives Identify Principles of Forecasting Explain the steps in the forecasting

More information

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7 BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7 1. The definitions follow: (a) Time series: Time series data, also known as a data series, consists of observations on a

More information

NOWCASTING REPORT. Updated: September 7, 2018

NOWCASTING REPORT. Updated: September 7, 2018 NOWCASTING REPORT Updated: September 7, 2018 The New York Fed Staff Nowcast stands at 2.2% for 2018:Q3 and 2.8% for 2018:Q4. News from this week s data releases increased the nowcast for 2018:Q3 by 0.2

More information

Assessing recent external forecasts

Assessing recent external forecasts Assessing recent external forecasts Felipe Labbé and Hamish Pepper This article compares the performance between external forecasts and Reserve Bank of New Zealand published projections for real GDP growth,

More information

Regression of Time Series

Regression of Time Series Mahlerʼs Guide to Regression of Time Series CAS Exam S prepared by Howard C. Mahler, FCAS Copyright 2016 by Howard C. Mahler. Study Aid 2016F-S-9Supplement Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

Forecasting of ATM cash demand

Forecasting of ATM cash demand Forecasting of ATM cash demand Ayush Maheshwari Mathematics and scientific computing Department of Mathematics and Statistics Indian Institute of Technology (IIT), Kanpur ayushiitk@gmail.com Project guide:

More information

Nowcasting gross domestic product in Japan using professional forecasters information

Nowcasting gross domestic product in Japan using professional forecasters information Kanagawa University Economic Society Discussion Paper No. 2017-4 Nowcasting gross domestic product in Japan using professional forecasters information Nobuo Iizuka March 9, 2018 Nowcasting gross domestic

More information

How to shape future met-services: a seamless perspective

How to shape future met-services: a seamless perspective How to shape future met-services: a seamless perspective Paolo Ruti, Chief World Weather Research Division Sarah Jones, Chair Scientific Steering Committee Improving the skill big resources ECMWF s forecast

More information

NOWCASTING REPORT. Updated: May 5, 2017

NOWCASTING REPORT. Updated: May 5, 2017 NOWCASTING REPORT Updated: May 5, 217 The FRBNY Staff Nowcast stands at 1.8% for 217:Q2. News from this week s data releases reduced the nowcast for Q2 by percentage point. Negative surprises from the

More information