Self-Adaptive Workload Classification and Forecasting for Proactive Resource Provisioning

Similar documents
DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

Automatic forecasting with a modified exponential smoothing state space framework

The value of feedback in forecasting competitions

Forecasting: Methods and Applications

Intermittent state-space model for demand forecasting

FORECASTING. Methods and Applications. Third Edition. Spyros Makridakis. European Institute of Business Administration (INSEAD) Steven C Wheelwright

Forecasting with R A practical workshop

Nonparametric time series forecasting with dynamic updating

Local linear forecasts using cubic smoothing splines

Do we need Experts for Time Series Forecasting?

Department of Econometrics and Business Statistics

Modified Holt s Linear Trend Method

at least 50 and preferably 100 observations should be available to build a proper model

Statistical Methods for Forecasting

Using Temporal Hierarchies to Predict Tourism Demand

Complex exponential Smoothing. Ivan Svetunkov Nikolaos Kourentzes Robert Fildes

Forecasting. Dr. Richard Jerz rjerz.com

The SAB Medium Term Sales Forecasting System : From Data to Planning Information. Kenneth Carden SAB : Beer Division Planning

USING DISCRETE EVENT SIMULATION TO EVALUATE TIME SERIES FORECASTING METHODS FOR SECURITY APPLICATIONS

Visualization of distance measures implied by forecast evaluation criteria

How to deal with uncertainties and dynamicity?

Time-Series analysis for wind speed forecasting

The Applicability of Adaptive Control Theory to QoS Design: Limitations and Solutions

A time series is called strictly stationary if the joint distribution of every collection (Y t

FORECASTING METHODS AND APPLICATIONS SPYROS MAKRIDAKIS STEVEN С WHEELWRIGHT. European Institute of Business Administration. Harvard Business School

An approach to make statistical forecasting of products with stationary/seasonal patterns

I N T R O D U C T I O N : G R O W I N G I T C O M P L E X I T Y

Available online at ScienceDirect. Procedia Computer Science 35 (2014 )

FORECASTING TIME SERIES WITH BOOT.EXPOS PROCEDURE

Elements of Multivariate Time Series Analysis

Time Series and Forecasting

Requests Prediction in Cloud with a Cyclic Window Learning Algorithm

Time Series and Forecasting

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

Transformations for variance stabilization

Operations Management

Chart types and when to use them

Exponential smoothing in the telecommunications data

EXPONENTIAL SMOOTHING MODELING AND FORECASTING FOR INCIDENCE OF TUBERCULOSIS IN INDIA

Short-term electricity demand forecasting in the time domain and in the frequency domain

A Procedure for Identification of Appropriate State Space and ARIMA Models Based on Time-Series Cross-Validation

Time Series: Theory and Methods

Time Series Analysis -- An Introduction -- AMS 586

Observed cloud anomalies associated with the North Atlantic Oscillation and their radiative feedback

FORECASTING AND MODEL SELECTION

NATCOR. Forecast Evaluation. Forecasting with ARIMA models. Nikolaos Kourentzes

th Hawaii International Conference on System Sciences

Chapter 13: Forecasting

Forecasting Data Streams: Next Generation Flow Field Forecasting

Functional time series

Sample Exam Questions for Econometrics

LOAD FORECASTING APPLICATIONS for THE ENERGY SECTOR

FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

Ch. 12: Workload Forecasting

Forecasting Gold Price. A Comparative Study

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

A state space model for exponential smoothing with group seasonality

Statistical modelling in climate science

Automatic Forecasting

Components for Accurate Forecasting & Continuous Forecast Improvement

Lecture 1: Introduction to Forecasting

MGR-815. Notes for the MGR-815 course. 12 June School of Superior Technology. Professor Zbigniew Dziong

Chapter-1 Introduction

Forecasting Casino Gaming Traffic with a Data Mining Alternative to Croston s Method

Nonlinear Characterization of Activity Dynamics in Online Collaboration Websites

Estimation Techniques for Monitoring and Controlling the Performance of the Computer Communication Networks

A State Space Framework For Automatic Forecasting Using Exponential Smoothing Methods

Contents LIST OF TABLES... LIST OF FIGURES... xvii. LIST OF LISTINGS... xxi PREFACE. ...xxiii

Scheduling I. Today. Next Time. ! Introduction to scheduling! Classical algorithms. ! Advanced topics on scheduling

Dynamic Service Placement in Geographically Distributed Clouds

Multivariate forecasting with VAR models

Migration-based Detection and Location of the Seismicity Induced at Rittershoffen Geothermal Field (Alsace, France)

arxiv: v1 [cs.lg] 5 Dec 2018

Part I State space models

Another look at measures of forecast accuracy

An Experimental Evaluation of Passage-Based Process Discovery

Lecture 13. Real-Time Scheduling. Daniel Kästner AbsInt GmbH 2013

Forecasting using robust exponential smoothing with damped trend and seasonal components

Prediction Experience and New Model

Day-ahead time series forecasting: application to capacity planning

Ex-Ante Forecast Model Performance with Rolling Simulations

Tuan V. Dinh, Lachlan Andrew and Philip Branch

Graceway Publishing Company, Inc.

Che-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University

Prashant Pant 1, Achal Garg 2 1,2 Engineer, Keppel Offshore and Marine Engineering India Pvt. Ltd, Mumbai. IJRASET 2013: All Rights are Reserved 356

Revenue Maximization in a Cloud Federation

CPU scheduling. CPU Scheduling

Meta-learning how to forecast time series

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

SOFTWARE ARCHITECTURE DESIGN OF GIS WEB SERVICE AGGREGATION BASED ON SERVICE GROUP

Complex Exponential Smoothing

Cyclical Effect, and Measuring Irregular Effect

Forecasting Egyptian GDP Using ARIMA Models

APPLIED TIME SERIES ECONOMETRICS

TIME SERIES DATA PREDICTION OF NATURAL GAS CONSUMPTION USING ARIMA MODEL

Forecasting: Principles and Practice. Rob J Hyndman. 1. Introduction to forecasting OTexts.org/fpp/1/ OTexts.org/fpp/2/3

A Measurement-Analytic Approach for QoS Estimation in a Network Based on the Dominant Time Scale

Integrated Electricity Demand and Price Forecasting

Analytic Performance Modeling and Optimization of Live VM Migration

Old dog, new tricks: a modelling view of simple moving averages

Transcription:

Self-Adaptive Workload Classification and Forecasting for Proactive Resource Provisioning Nikolas R. Herbst, Nikolaus Huber, Samuel Kounev, Erich Amrehn (IBM R&D) SOFTWARE DESIGN AND QUALITY GROUP INSTITUTE FOR PROGRAM STRUCTURES AND DATA ORGANIZATION, FACULTY OF INFORMATICS ICPE 2013, Prague KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu

workload intensity Motivation history now near future

workload intensity Motivation history now near future

workload intensity Motivation history now near future

workload intensity Motivation history now near future

workload intensity Motivation history now near future Forecast approaches of the time series analysis:

workload intensity Motivation history now near future Forecast approaches of the time series analysis: Idea: Intelligent and dynamic use of different tools out of the toolkit Goal: Providing information on most likely future developments Knowing about a problem before running into it

Workload Classification & Forecasting (WCF) Self-adaptive Mechanism Overhead Limitation Forecast Frequency Forecast Objectives Forecast Horizon Confidence Level

Workload Classification & Forecasting (WCF) Self-adaptive Mechanism Overhead Limitation Forecast Frequency Forecast Objectives Forecast Horizon Confidence Level Forecast Strategy Overhead Groups INITIAL Naïve, Smoothing, Random Walk FAST Trend Interpolation COMPLEX Decomposition & Seasonal Patterns

Workload Classification & Forecasting (WCF) Self-adaptive Mechanism Overhead Limitation Forecast Objectives Forecast Frequency Forecast Objectives Workload Intensity Trace Forecast Horizon Confidence Level (I) Select Strategies Forecast Strategy Overhead Groups INITIAL Naïve, Smoothing, Random Walk FAST Trend Interpolation COMPLEX Decomposition & Seasonal Patterns

Workload Classification & Forecasting (WCF) Self-adaptive Mechanism Overhead Limitation Forecast Objectives Forecast Frequency Forecast Objectives Workload Intensity Trace Forecast Horizon Confidence Level Accuracy Feedback (I) Select Strategies (II) Forecast & Evaluate Accuracy Forecast Strategy Overhead Groups INITIAL Naïve, Smoothing, Random Walk FAST Trend Interpolation COMPLEX Decomposition & Seasonal Patterns

Time Series Analysis [BFAST]

Time Series Analysis [BFAST]

Time Series Analysis [BFAST]

Time Series Analysis [BFAST]

Forecasting Strategies Survey Basic Methods Naïve, Moving Averages, Random Walk (initial)

Forecasting Strategies Survey Basic Methods Naïve, Moving Averages, Random Walk Trend Interpolation Simple Exponential Smoothing (SES) Cubic Smoothing Splines Croston s method for intermittent time series Autoregressive Moving Averages (ARMA11) (initial) (fast) [Hynd08] [Hynd02] [Shen05] [Box08]

Forecasting Strategies Survey Basic Methods Naïve, Moving Averages, Random Walk Trend Interpolation Simple Exponential Smoothing (SES) Cubic Smoothing Splines Croston s method for intermittent time series Autoregressive Moving Averages (ARMA11) Estimation and Modelling of Seasonal Pattern Extended Exponential Smoothing (ETS) ARIMA framework with automatic model selection tbats for complex seasonal patterns (initial) (fast) [Hynd08] [Hynd02] [Shen05] [Box08] (complex) [Hynd08, Hyn08] [Box08, Hynd08] [Live11]

values Forecast Accuracy Metric Mean absolute scaled error (MASE) [Hynd06] e 3 Observation Forecast 0 1 2 n time[t] e t = forecastvalue t observedvalue t n b n = 1 n observedvalue i observedvalue i 1 i=1 mase 0; n = mean t={1;n} ( e t b ) n

values values Forecast Accuracy Metric Mean absolute scaled error (MASE) [Hynd06] e 3 Observation Forecast Observation Naive Forecast Forecast Reference 0 1 2 n time[t] 0 1 2 n time[t] e t = forecastvalue t observedvalue t n b n = 1 n observedvalue i observedvalue i 1 i=1 mase 0; n = mean t={1;n} ( e t b ) n

Workload Intensity Characterization High level data analysis to gain information on: Noise level & occurrences of unpredictable bursts Influence of trends and seasonal patterns Workload Intensity Trace: Time Series of Request Arrival Rates Data Length Positivity Frequency Attributes Gradient Monotonicity Variance Quartile Dispersion Burstiness Metrics

Decision Tree for Classification

Experiment: Example for Forecast Accuracy Improvement Real-world workload intensity trace: IBM CICS transactions on System z mainframe Comparison of WCF approach to ETS and Naïve forecast

WCF ETS Naive Experiment WCF ETS Naïve percentage error

Case Study: Example for Using Forecast Results Scenario: Additional server instances at certain thresholds, 3 weeks Real-world workload intensity trace (Wikipedia DE page requests per hour)

Case Study Resource provisioning: (I) Without forecasting (solely reactive): Resource provisioning actions triggered by 76 SLA violations (II) Interpreting WCF forecast results (add. proactive): Reduction to 34 or less SLA violations No significant change in resource usage observed (server instances per hour)

Related Work Time Series Analysis [Box08, Hynd08 Shum11] Workload & Performance Monitoring e.g. [vhoo12] Forecasting for Proactive Resource Provisioning [Grun12, Hedw10, Gmac08, Mena04, Bens07] Self-Adaptive Resource Provisioning e.g. [Hube11, Koun10] Focus on forecasting of performance metrics Focus on single tools of the toolkit or other toolkits not workload intensity no dynamic selection

Summary & Outlook Survey on Forecast Approaches Implementation of the WCF-System provides continuous forecast results at run-time Forecast Accuracy Improvement: > 37 % compared to ETS as an established approach Proactive Resource Provisioning enabled: > Up to 75 % less SLA violations than reactive Construction of a Workload Classification Scheme Experiments and Case Study: Evaluation based on real-world workload intensity traces Future Work: > System Integration with Monitoring > Filters: Objective Selection, Splitting > Use for Anomaly Detection [Biel12]

Literature [Bens07] [BFAST] [Biel12] [Box08] M. Bensch, D. Brugger, W. Rosenstiel, M. Bogdan, W. G. Spruth, and P. Baeuerle, Selflearning prediction system for optimization of workload management in a mainframe operating system," in ICEIS 2007 R Package: BFAST, Breaks for additive Season and Trend, http://bfast.r-forge.r-project.org/ T. C. Bielefeld, Online Performance Anomaly Detection for Large-Scale Software Systems, March 2012, Diploma Thesis, University of Kiel G. E. P. Box, G. M. Jenkins, and G. C. Reinsel, Time series analysis : forecasting and control, 2008 [Gmac07] D. Gmach, J. Rolia, L. Cherkasova, and A. Kemper, Workload analysis and demand prediction of enterprise data center applications, IISWC '07 [Grun12] A. Amin, A. Colman, and L. Grunske, An Approach to Forecasting QoS Attributes of Web Services Based on ARIMA and GARCH Models, ICWS 2012 [Hedw10] M. Hedwig, S. Malkowski, C. Bodenstein, and D. Neumann, Towards autonomic cost-aware allocation of cloud resources, ICIS 2010 [Hube11] [Hynd02] N. Huber, F. Brosig, and S. Kounev, Model-based Self-Adaptive Resource Allocation in Virtualized Environments," SEAMS 2011 R. J. Hyndman, M. L. King, I. Pitrun, and B. Billah, Local linear forecasts using cubic smoothing splines, Monash University, Department of Econometrics and Business Statistics, 2002

Literature [Hynd06] [Hynd08] [Hyn08] [Koun10] [Live11] [vhoo12] R. J. Hyndman and A. B. Koehler, Another look at measures of forecast accuracy, International Journal of Forecasting R. J. Hyndman and Y. Khandakar, Automatic time series forecasting: The forecast package for R 2008 R. J. Hyndman, Koehler, Forecasting with Exponential Smoothing : The State Space Approach, Springer Series in Statistics, Berlin, 2008 S. Kounev, F. Brosig, N. Huber, and R. Reussner, Towards self-aware performance and resource management in modern service-oriented systems, SCC '10 A. M. De Livera, R. J. Hyndman, and R. D. Snyder, Forecasting time series with complex seasonal patterns using exponential smoothing, Journal of the American Statistical Association, vol. 106, no. 496, pp.1513, 2011 A. van Hoorn, J. Waller, and W. Hasselbring, Kieker: A framework for application performance monitoring and dynamic software analysis," ICPE 2012 [Mena04] M. N. Bennani and D. A. Menasce, Assessing the robustness of self-managing computer systems under highly variable workloads, 2004 [Shen05] L. Shenstone and R. J. Hyndman, Stochastic models underlying Croston's method for intermittent demand forecasting," Journal of Forecasting, vol. 24, no. 6, pp. 389, 2005 [Shum11] R. H. Shumway, Time Series Analysis and Its Applications: With R Examples, Springer

Backup: Experiment WCF4 vs. tbats, ARIMA WCF limited to choose from tbats and ARIMA Significant accuracy improvement by combination and dynamic strategy selection

Backup: Process of Classification & Forecasting classification: (I) select strategies classification: (II) evaluate selection new classification: (I) select strategies forecast execution & result output forecast execution & result output forecast execution & result output forecast execution & result output forecast horizon/ frequency (need not to be equal) new forecast horizon classification frequency (# forecast executions)

Backup: Architecture and Implementation

Backup: Sequence Diagram

Backup: Class Diagram - Management Nikolas R. Herbst Workload Classification and Forecasting ICPE 2013,

Backup: Class Diagram - Classification

Backup: Class Diagram Forecasting

Backup: Forecast Objectives

Backup: Forecast Strategy Overhead Groups