A stochastic model-based approach to online event prediction and response scheduling

Similar documents
A stochastic model-based approach to online event prediction and response scheduling

A framework for simulation and symbolic state space analysis of non-markovian models

Probabilistic Deadline Miss Analysis of Real-Time Systems Using Regenerative Transient Analysis

An inspection-based compositional approach to the quantitative evaluation of assembly lines

Exploiting non-deterministic analysis in the integration of transient solution techniques for Markov Regenerative Processes

Tennis player segmentation for semantic behavior analysis

Stochastic Renewal Processes in Structural Reliability Analysis:

Lecturer: Olga Galinina

f X (x) = λe λx, , x 0, k 0, λ > 0 Γ (k) f X (u)f X (z u)du

Anomaly Detection for the CERN Large Hadron Collider injection magnets

Smart Home Health Analytics Information Systems University of Maryland Baltimore County

Lecture 4 Discriminant Analysis, k-nearest Neighbors

ring structure Abstract Optical Grid networks allow many computing sites to share their resources by connecting

Markov localization uses an explicit, discrete representation for the probability of all position in the state space.

Mining State Dependencies Between Multiple Sensor Data Sources

DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning. Min Du, Feifei Li, Guineng Zheng, Vivek Srikumar University of Utah

Environment (E) IBP IBP IBP 2 N 2 N. server. System (S) Adapter (A) ACV

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

Abstract. 1 Introduction. Cointerpretation of Flow Rate-Pressure-Temperature Data from Permanent Downhole Gauges. Deconvolution. Breakpoint detection

CSL302/612 Artificial Intelligence End-Semester Exam 120 Minutes

A Probabilistic Relational Model for Characterizing Situations in Dynamic Multi-Agent Systems

Markov Model. Model representing the different resident states of a system, and the transitions between the different states

Quantitative evaluation of concurrent systems with non-markovian temporal parameters

5. Introduction to Euclid s Geometry

Logic, Knowledge Representation and Bayesian Decision Theory

A Hybrid Approach to Recognising Activities of Daily Living from Object Use in the Home Environment

Contents Preface The Exponential Distribution and the Poisson Process Introduction to Renewal Theory

Semi-Markovian User State Estimation and Policy Optimization for Energy Efficient Mobile Sensing

ONR Mine Warfare Autonomy Virtual Program Review September 7, 2017

Solving Zero-Sum Extensive-Form Games. Branislav Bošanský AE4M36MAS, Fall 2013, Lecture 6

Final Exam December 12, 2017

Neural Networks. Bishop PRML Ch. 5. Alireza Ghane. Feed-forward Networks Network Training Error Backpropagation Applications

Process Scheduling for RTS. RTS Scheduling Approach. Cyclic Executive Approach

Integrated Electricity Demand and Price Forecasting

Modeling Data with Linear Combinations of Basis Functions. Read Chapter 3 in the text by Bishop

Activity Mining in Sensor Networks

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Multistate Modeling and Applications

Spatial Transformation

As Soon As Probable. O. Maler, J.-F. Kempf, M. Bozga. March 15, VERIMAG Grenoble, France

Forecasting Wind Ramps

UNIT Define joint distribution and joint probability density function for the two random variables X and Y.

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling

Final Exam December 12, 2017

COMS 4771 Lecture Boosting 1 / 16

Feed-forward Networks Network Training Error Backpropagation Applications. Neural Networks. Oliver Schulte - CMPT 726. Bishop PRML Ch.

Introduction to Restricted Boltzmann Machines

6 Reinforcement Learning

CptS 570 Machine Learning School of EECS Washington State University. CptS Machine Learning 1

A DARK GREY P O N T, with a Switch Tail, and a small Star on the Forehead. Any

Glossary availability cellular manufacturing closed queueing network coefficient of variation (CV) conditional probability CONWIP

Uncertainty Quantification in Performance Evaluation of Manufacturing Processes

Empirical Risk Minimization Algorithms

STOCHASTIC MODELS FOR RELIABILITY, AVAILABILITY, AND MAINTAINABILITY

Basic concepts of probability theory

Learning Methods for Linear Detectors

Predicting Target Events in Industrial Domains

Partially Observable Markov Decision Processes (POMDPs)

IN THIS paper we investigate the diagnosability of stochastic

Abstract. 1. Introduction

Fair Divsion in Theory and Practice

Performance Evaluation. Transient analysis of non-markovian models using stochastic state classes

Introduction to Mobile Robotics Probabilistic Robotics

Data Analyzing and Daily Activity Learning with Hidden Markov Model

Bayesian Networks Inference with Probabilistic Graphical Models

Reliability Engineering I

Multimedia Databases. Previous Lecture Video Abstraction Video Abstraction Example 6/20/2013

SVMs: Non-Separable Data, Convex Surrogate Loss, Multi-Class Classification, Kernels

University of Genova - DITEN. Smart Patrolling. video and SIgnal Processing for Telecommunications ISIP40

Methods for the specification and verification of business processes MPB (6 cfu, 295AA)

Kalman filtering and friends: Inference in time series models. Herke van Hoof slides mostly by Michael Rubinstein

2. Stochastic Time Petri Nets

Intro to Probability. Andrei Barbu

Two-Sample Inferential Statistics

NCDREC: A Decomposability Inspired Framework for Top-N Recommendation

Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds.

Contents Lecture 4. Lecture 4 Linear Discriminant Analysis. Summary of Lecture 3 (II/II) Summary of Lecture 3 (I/II)

Partially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS

Acyclic discrete phase type distributions: properties and a parameter estimation algorithm

CS 347 Parallel and Distributed Data Processing

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks

Homework 1/Solutions. Graded Exercises

A Probabilistic Relational Model for Characterizing Situations in Dynamic Multi-Agent Systems

Lecture 3: Policy Evaluation Without Knowing How the World Works / Model Free Policy Evaluation

MAT SYS 5120 (Winter 2012) Assignment 5 (not to be submitted) There are 4 questions.

Stochastic process. X, a series of random variables indexed by t

LIMITS FOR QUEUES AS THE WAITING ROOM GROWS. Bell Communications Research AT&T Bell Laboratories Red Bank, NJ Murray Hill, NJ 07974

J. MEDHI STOCHASTIC MODELS IN QUEUEING THEORY

A Gentle Introduction to Reinforcement Learning

CAP Plan, Activity, and Intent Recognition

Learning theory. Ensemble methods. Boosting. Boosting: history

Exact Inference by Complete Enumeration

Switching-state Dynamical Modeling of Daily Behavioral Data

Lecture notes: Rust (1987) Economics : M. Shum 1

A PARAMETRIC DECOMPOSITION BASED APPROACH FOR MULTI-CLASS CLOSED QUEUING NETWORKS WITH SYNCHRONIZATION STATIONS

Failure prognostics in a particle filtering framework Application to a PEMFC stack

CS 234 Midterm - Winter

Lecture 18: Noise modeling and introduction to decision theory

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Financial Econometrics / 49

Transcription:

A stochastic model-based approach to online event prediction and response scheduling M. Biagi, L. Carnevali, M. Paolieri, F. Patara, E. Vicario Department of Information Engineering, University of Florence, Italy marco.biagi@unifi.it Chios, EPEW October 5, 26 This is about: prediction and scheduling in partially observable systems with a stochastic model-based approach / 23

Aim Partially observable system: any system where part of the state remains hidden to observations Diagnosis: estimate current state based on observations Prediction: evaluate future state probabilities given a distribution of initial current state Scheduling: select an action type and a time point in the future 2 / 23

An application context Ambient Assisted Living (AAL): aims at providing assistance to people living within smart environments through the integration and exploitation of new sensing technologies, data processing techniques, and services 3 / 23

An application context T.vanKasteren, A.Noulas, G.Englebienne, B.Krse, Accurate Activity Recognition in a Home Setting, UbiComp 8 4 / 23

Diagnosis given a stream of Observations (EventType and Timestamp), evaluate the current ongoing Activity of the subject and a measure of likelihood of such Activity diagnosis 2 provides a list of triplets π, x, R(t) where π [, ] is a likelihood measure x S is a state of the system R(t) is the PDF of the remaining sojourn time in x (estimated numerically on a grid of time points). 2 L.Carnevali, C.Nugent, F.Patara, E.Vicario, A Continuous-Time Model-Based Approach to Activity Recognition for Ambient Assisted Living QEST 5 5 / 23

Prediction an activity type is chosen as target the aim is to predict when the next instance of the target activity type will begin at each new diagnosis result, estimate the first-passage transient probabilities to the target activity 6 / 23

Prediction technique: a model for prediction model-based approach a predefined parametric structure: stochastic parameters derived automatically from the statistics in process mining terminology: enhancement without elicitation p xy idle xy act x idle yx p yx act y p xz idle zx idle p zy p yz zx p zy idle xz act z idle yz 7 / 23

Prediction technique: a model for prediction model structure is chosen so as to fit prediction needs take into account GEN state sojourn time distribution idle states are specialized, this allows us to: preserve memory of the foregoing activity (trade-off memory/complexity) duration depends on previous and following activities. Allows to have idles durations with a lower CV that is a good feature for prediction never observed states (e.g from dinner to dinner) are not included allowing to reduce the space of possible behaviors p xy idle xy act x idle yx p yx act y p xz idle zx idle p zy p yz zx p zy idle xz act z idle yz 8 / 23

Prediction technique: derivation of stochastic parameters sojourn time distributions are evaluated via a basic approximants fitting of expected value and variation coefficient 3 for CV >, a mixture of 2 EXP for > CV > 2 a sequence of 2 EXP for 2 CV, a shifted EXP transition probabilities p xy that activity act y will follow act x is estimated as p xy = { i < n α i = act x and α i+ = act y } { i < n α i = act x } 3 W. Whitt, Approximating a Point Process by a Renewal Process, I: Two Basic Methods, Operations Research, 982 9 / 23

Prediction technique: distribution to critical event π A =.7 A D E π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) for each possible path to the critical state, compose the remaining time in the current activity with the Semi Markov sojourn in subsequent activities / 23

Prediction technique: distribution to critical event π A =.7 A D E π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) 2 sum up first passage time distributions weighted by the diagnosed state likelihood PDF (t) A to F π A PDF (t) B to F π B PDF (t) to F..8.6.4.4.3 + =.2.8.6.4.2. t 2 3 4 5 2 3 4 5.2 t 2 3 4 5 t / 23

Scheduling Select and schedule a response action to be taken before or at the beginning of some target activity (e.g.: remind pills intake 5 minutes before a meal) 2 / 23

Scheduling: problem formulation 4 Response actuation time: response action must be executed t w time units before the beginning of the target activity. Response duration: response action is active for a fixed duration : target activities are successfully handled by the system if they start t w time units after the response action was active. dinner t ev t t w time E.g. pills intake reminder: t w = 3s = 2s Remind at time t 4 F. Salfner et al., A Survey of Online Failure Prediction Methods, ACM Comput. Surv., 2 3 / 23

Scheduling: reward and decision policy performance evaluation: based on the number of true positives TP, false positives FP and false negatives FN maximum probability interval policy: activation time is evaluated searching for the maximum probability interval of width if maximum probability is higher than a fixed threshold, schedule at t = x t w, where x is the start of the interval otherwise, no response action is scheduled dinner t ev t PDF (t) x t w t time 4 / 23

Scheduling: an example a = α, τ, δ e τ t τ + δ t () PDF (t) t w t a 2 = α 2, τ 2, δ 2 τ 2 τ 2 + δ 2 time 5 / 23

Scheduling: an example e e 2 τ t t2 τ + δ PDF (t) a = α, τ, δ PDF 2 (t) t w e 3 t3 t (2) t a 2 = α 2, τ 2, δ 2 e 4 e 5 τ 2 t4 t5 τ 2 + δ 2 time t () Schedule t () is updated at e 2 t w PDF 3 (t) I t (3) t w I 2 t Schedule t (2) results in a FP Schedule t (3) results in a TP t 6 / 23

Experiments: methodology two types of datasets considered: synthetic datasets real AAL dataset 5 compare also different fitting strategies for sojourn distributions: the mean value with an EXP distribution the mean value with a two phases Erlang distribution the mean value and CV by the Whitt approach 6 exact state diagnosis assumed as input, allowing to focus on predictor and scheduler performance results as precision = TP/(TP + FP) and recall = TP/(TP + FN) 5 T.vanKasteren, A.Noulas, G.Englebienne, B.Krse, Accurate Activity Recognition in a Home Setting, UbiComp 8 6 W. Whitt, Approximating a Point Process by a Renewal Process, I: Two Basic Methods, Operations Research, 982 7 / 23

Experiments: generating a synthetic datasets generated by simulation in order to analyze how real phenomena sojourn distributions affects system predictability predefined model structure x, y p x,y =.5 events with Exp() inter-times p, p,2 p 2,3 act idle, act idle,2 act 2 idle 2,3 act 3 p, p, p 2, p 3, idle, idle, idle 2, idle 3, 8-phase Erlang: Erlang(k = 8, λ = 4) µ = 2, CV = 8 2-phase Erlang: Erlang(k = 2, λ = ) µ = 2, CV = 2 2-phase Hyper-exp: hyper-exp µ = 2, CV.92 increase CV 8 / 23

Experiments: synthetic datasets results.9.8.7.6.5.4.3.2. Precision 5 5 2 t w = t w = 6 t w = 2 t w = 3.9.8.7.6.5.4.3.2. Recall 5 5 2 t w = t w = 6 t w = 2 t w = 3 precision and recall with Whitt fitting on 8-phase Erlang 9 / 23

Experiments: synthetic datasets results.9.8.7.6.5.4.3.2. Whitt Erlang Exp t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2 comparing precisions of fitting strategy on 8-phase Erlang synthetic dataset 2 / 23

Experiments: synthetic datasets results.9.8.7.6.5.4.3.2. 8-ph Erlang CV = 8 2-ph Erlang CV = 2 Hyper-exp CV.92 t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2 comparing precisions with Whitt fitting on different synthetic sojourn distributions 2 / 23

Experiments: real AAL dataset results increase F F = 2 (precision recall) (precision+recall) Preparing dinner Preparing breakfast Preparing breakfast (filtered) (s) 5 3 6 2 8 /.63/.533.265/.684.52/.684 /.86 /.86 6 /.5/.25.2/.385.2/.385.68/.78.68/.78 t w (s) 2 /.2/.385.2/.385.2/.385.68/.78.68/.78 3 / / / /.48/.6.48/.6 t w (s) t w (s) /.93/.385.74/.385.74/.385.929/.448.929/.448 6 /.24/.36.24/.36.24/.36.462/.25.462/.25 2 / / / /.24/.36.24/.36 3 / / / /.2/.36.2/.36 /.465/.74.489/.759.557/.88.66/.87.672/.93 6 /.222/.556.39/.667.33/.735.48/.828.476/.896 2 /.63/.26.28/.467.225/.69.288/.778.338/.839 3 /.53/.28.88/.429.78/.643.277/.8.329/.836 22 / 23

Conclusion and future development we presented a model-based approach to online event prediction and response scheduling fitting both sojourn times mean value and CV improves precision of the prediction performance of the technique depends on the predictability of the system this technique can be applied to AAL for small values of response actuation time future evolution often depends on the history of previous activities, breaking the hypothesis of semi-markov models performance achieved by different model structures will be investigated in the future applicability beyond the limits of AAL perhaps better suited to more structured and predictable systems e.g. monitoring of industrial system e.g. gas/water distribution network 23 / 23

24 / 23

An application context Observed: Open fridge, open bathroom door Hidden states: Dinner, toileting,... Floorplan of the house, red rectangle boxes indicate sensor nodes 7 7 T.vanKasteren, A.Noulas, G.Englebienne, B.Krse, Accurate Activity Recognition in a Home Setting, UbiComp 8 25 / 23

A reference dataset for AAL a sequence of Observations with manually annotated Activities 8 7+ Activity Types from the index of Activities of Daily Living 28 Event Types (On Off of 4 state-change Sensors) 28 days, 245 annotated Activities and 29 unclassified intervals (Idle), 2638 Observations,... a Idling k- a Idling k a k+ δ δ k-,k k e n-u e n-u+ e n- e n e n+ e n+h............... end t k- t k start t k end start t k+ single subject, no concurrent Activities, reliable sensors, some inaccuracies start and completion of Activities do not coincide with Observations (and they are thus not observable) 8 T.vanKasteren, A.Noulas, G.Englebienne, B.Krse, Accurate Activity Recognition in a Home Setting, UbiComp 8 26 / 23

Prediction technique: a model for prediction Activity µ CV bed 26832.3.43 toilet 88.82.87 breakfast 5.5.72 shower 485.7.3 leave 39764.94.6 drink 35.25.35 dinner 862.3.63 idle 88.77 2. Activity µ CV idle bed 56.2.52 idle toilet 377.69.47 idle breakfast 244.5.58 idle shower 22.87 2.8 idle leave 687.33.54 idle drink 363.75.86 idle dinner 735.3.76 Activity µ CV idle bed toilet 434.75.9 idle bed breakfast 863.5.87 idle toilet bed 58.32 2.4 idle toilet toilet 3568.56.8 idle toilet breakfast 8.93.5 idle toilet shower 463.4.9 idle toilet leave 595.45.53 idle toilerink 364.5.97 idle toileinner 439.33.97 idle breakfast toilet 47.75.66 idle breakfast shower 53.73.38 idle breakfasrink 425.. idle shower toilet 2.. idle shower leave 722.38.73 idle shower dinner 2525.. idle leave bed 22.. idle leave toilet 82.96.5 idle leave shower 75..92 idle leave leave 359.. idle leave drink 28.67.2 idle leave dinner 76.. idle drink toilet 37.69.86 idle drink breakfast 8.. idle drink drink 4282..68 idle dinner toilet 6256..76 idle dinner leave 3474.. idle dinner drink 95.29.68 27 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 28 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 29 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 2 evaluate next states reached after current state (C and D from A) 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 3 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 2 evaluate next states reached after current state (C and D from A) 3 P DF (t) and P CF (t) are the distributions from next states to the critical state and was evaluated offline with transient analysis 9 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 3 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 2 evaluate next states reached after current state (C and D from A) 3 P DF (t) and P CF (t) are the distributions from next states to the critical state and was evaluated offline with transient analysis 9 4 evaluate convolution between R A (t) and P CF (t) weighted by p AC 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 32 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 2 evaluate next states reached after current state (C and D from A) 3 P DF (t) and P CF (t) are the distributions from next states to the critical state and was evaluated offline with transient analysis 9 4 evaluate convolution between R A (t) and P CF (t) weighted by p AC 5 evaluate convolution between R A (t) and P DF (t) weighted by p AD 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 33 / 23

Prediction technique: distribution to critical event π A =.7 A p AD D E p AC π B =.3 B C F for each current plausible state (A and B) evaluate the first-passage distribution to the critical state (gray state) mark critical state as absorbing 2 evaluate next states reached after current state (C and D from A) 3 P DF (t) and P CF (t) are the distributions from next states to the critical state and was evaluated offline with transient analysis 9 4 evaluate convolution between R A (t) and P CF (t) weighted by p AC 5 evaluate convolution between R A (t) and P DF (t) weighted by p AD 6 their sum is the first passage time distribution from A to F 9 A. Horváth, M. Paolieri, L. Ridi, E.Vicario, Transient analysis of non-markovian models using stochastic state classes Perform. Evaluation, 22. 34 / 23

Experiments: synthetic datasets results.9.8.7.6.5.4.3.2. Precision t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. Recall t w = t w = 6 t w = 2 t w = 3 5 5 2 Whitt.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2 EXP comparing fitting strategy on 8-phase Erlang synthetic dataset.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2 Erlang 35 / 23

Experiments: synthetic datasets results.9.8.7.6.5.4.3.2. Precision t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. Recall t w = t w = 6 t w = 2 t w = 3 5 5 2 8-ph Erlang.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2.9.8.7.6.5.4.3.2. t w = t w = 6 t w = 2 t w = 3 5 5 2 2-ph Erlang comparing results with Whitt fitting on different underlying distributions.9.8.7.6.5.4.3.2. 5 5 2 t t w = t w = 6 t w = 2 t w = 3.9.8.7.6.5.4.3.2. 5 5 2 t t w = t w = 6 t w = 2 t w = 3 Hyper-exp 36 / 23