Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties and its Application to Cyber-Physical Systems

Similar documents
Data-Driven Probabilistic Modeling and Verification of Human Driver Behavior

Data-Driven Probabilistic Modeling and Verification of Human Driver Behavior

Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties

Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties

Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties

SFM-11:CONNECT Summer School, Bertinoro, June 2011

Probabilistic Model Checking Michaelmas Term Dr. Dave Parker. Department of Computer Science University of Oxford

Probabilistic Model Checking and Strategy Synthesis for Robot Navigation

PRISM An overview. automatic verification of systems with stochastic behaviour e.g. due to unreliability, uncertainty, randomisation,

ONR MURI AIRFOILS: Animal Inspired Robust Flight with Outer and Inner Loop Strategies. Calin Belta

Reasoning about Time and Reliability

Probabilistic model checking with PRISM

Motion Planning for LTL Specifications: A Satisfiability Modulo Convex Optimization Approach

DryVR: Data-driven verification and compositional reasoning for automotive systems

ROBUST & SPECULATIVE BYZANTINE RANDOMIZED CONSENSUS WITH CONSTANT TIME COMPLEXITY IN NORMAL CONDITIONS

Verifying Robustness of Human-Aware Autonomous Cars

Probabilistic Model Checking Michaelmas Term Dr. Dave Parker. Department of Computer Science University of Oxford


Preface. Motivation and Objectives

Verifying Randomized Distributed Algorithms with PRISM

Algorithmic Verification of Stability of Hybrid Systems

Caching Strategies for Run-time Probabilistic Model Checking

On Verification and Controller Synthesis for Probabilistic Systems at Runtime

Decentralized Control of Stochastic Systems

Controlling probabilistic systems under partial observation an automata and verification perspective

Monitoring the full range of ω-regular properties of Stochastic Systems

Trusted Machine Learning: Model Repair and Data Repair for Probabilistic Models

Estimation and Optimization: Gaps and Bridges. MURI Meeting June 20, Laurent El Ghaoui. UC Berkeley EECS

Models for Control and Verification

Combining Shared Coin Algorithms

Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications

Synthesis for Human-in-the-Loop Control Systems

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization

Distributed Optimization. Song Chong EE, KAIST

Stochastic, Hybrid and Real-Time Systems: From Foundations To Applications with Modest

Safe Control under Uncertainty

Bounded Model Checking with SAT/SMT. Edmund M. Clarke School of Computer Science Carnegie Mellon University 1/39

Probabilistic verification and approximation schemes

Probabilistic Model Checking Michaelmas Term Dr. Dave Parker. Department of Computer Science University of Oxford

Planning Under Uncertainty II

Variations on Itai-Rodeh Leader Election for Anonymous Rings and their Analysis in PRISM

Model, Data and Reward Repair: Trusted Machine Learning for Markov Decision Processes

Resilient Formal Synthesis

Timo Latvala. March 7, 2004

A Brief Introduction to Model Checking

Analysing Randomized Distributed Algorithms

Analysis and synthesis: a complexity perspective

Basics of Uncertainty Analysis

Using Theorem Provers to Guarantee Closed-Loop Properties

Alan Bundy. Automated Reasoning LTL Model Checking

CS Communication Complexity: Applications and New Directions

IE598 Big Data Optimization Introduction

Stochastic Reachable Sets of Interacting Traffic Participants

Probabilistic Model Checking: Advances and Applications

Statistical Model Checking Applied on Perception and Decision-making Systems for Autonomous Driving

Practicable Robust Markov Decision Processes

Computation Tree Logic (CTL) & Basic Model Checking Algorithms

Safety Verification of Fault Tolerant Goal-based Control Programs with Estimation Uncertainty

Verification and Control of Partially Observable Probabilistic Systems

A Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications

10 Robotic Exploration and Information Gathering

Ranking Verification Counterexamples: An Invariant guided approach

Introduction to Embedded Systems

Evaluating the Reliability of Defect-Tolerant Architectures for Nanotechnology with Probabilistic Model Checking

Uncertainty modeling for robust verifiable design. Arnold Neumaier University of Vienna Vienna, Austria

Integrating Induction and Deduction for Verification and Synthesis

A Bounded Statistical Approach for Model Checking of Unbounded Until Properties

Formal Verification via MCMAS & PRISM

Synthesizing from Components: Building from Blocks

ProbVerus: Probabilistic Symbolic Model Checking

Revisiting Synthesis of GR(1) Specifications

arxiv: v1 [math.oc] 11 Jan 2018

Connected Cruise Control Design Using Probabilistic Model Checking

arxiv: v2 [math.oc] 27 Aug 2018

Dynamic and Adversarial Reachavoid Symbolic Planning

Probabilistic and Logistic Circuits: A New Synthesis of Logic and Machine Learning

From Secure MPC to Efficient Zero-Knowledge

Formal verification of One Dimensional Time Triggered Velocity PID Controllers Kenneth Payson 12/09/14

Chapter 5: Linear Temporal Logic

On the Synergy of Probabilistic Causality Computation and Causality Checking

LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees

Compositional Verification of Probabilistic Systems using Learning

Probabilistic model checking with PRISM

Modeling and Analysis of Probabilistic Timed Systems

Probabilistic Model Checking of Randomised Distributed Protocols using PRISM

Automated Verification and Strategy Synthesis for Probabilistic Systems

Runtime Verification of Stochastic, Faulty Systems

Autonomous Agent Behaviour Modelled in PRISM A Case Study

Projection in Logic, CP, and Optimization

Introduction p. 1 Fundamental Problems p. 2 Core of Fundamental Theory and General Mathematical Ideas p. 3 Classical Statistical Decision p.

Safe Autonomy Under Perception Uncertainty Using Chance-Constrained Temporal Logic

The State Explosion Problem

15-780: LinearProgramming

PRISM: Probabilistic Model Checking for Performance and Reliability Analysis

Multi-Objective Planning with Multiple High Level Task Specifications

Facilitating Formal Verification of Cooperative Driving Applications: Techniques and Case Study. Shou-pon Lin

Compositional Reasoning

Module 8 Linear Programming. CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo

State Explosion in Almost-Sure Probabilistic Reachability

Chapter 13: Model Checking Linear-Time Properties of Probabilistic Systems

Transcription:

Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties and its Application to Cyber-Physical Systems Alberto Puggelli DREAM Seminar - November 26, 2013 Collaborators and PIs: Wenchao Li Dorsa Sadigh Katherine Driggs Campbell A. L. Sangiovanni-Vincentelli S. A. Seshia

Goal of this talk l Spur collaborations with other researchers in the department Developed theoretical framework Developed (prototype) tool implementation Now it is time to apply the framework to relevant case studies l Success stories Verification of human driver behavior (D. Sadigh, K. Driggs Campbell) On-going integration of the algorithms within PRISM (state-of-the-art tool developed at the University of Birmingham and Oxford University, UK)

Verify a Hybrid World with Uncertainties Sensor Networks SoC Power Management Biochemical Synthesis Need to formally verify and quantitatively analyze system performances in the presence of uncertainties (unmodeled dynamics, errors in parameter estimation, faulty and malicious behaviors) Stock Market Exchange Renewables Scheduling Robot Path Planning 2

Behavior of a Human Driver Crash The driver will always eventually perform the maneuver correctly FALSE The driver will perform the maneuver correctly with probability higher than 90% TRUE

Setup for Model Training courtesy of V. Vasudevan, K. Driggs Campbell, G. Juniwal l Intrinsic uncertainties in modeling the human behavior! l How can we account for this at verification time?

Two More Steps Towards the Goal Verify a Hybrid World with Uncertainties [Chatterjee et al. 08] PCTL verification for Interval-MDPs is at most in co-np [Kwiatkowska et al. 00] PRISM: Algorithms and Tool for PCTL verification of MDPs [Bianco 95-Courcoubetis 95] Verification algorithms for Markov Decision Processes (MDPs) [Puggelli et al. 13] Polynomial-time algorithm for PCTL verification of Convex-MDPs [Kozine et al. 02] Interval-MDP: Interval Uncertainties in transition probabilities of MDPs [Hansson et al. 94] Probabilistic Computation Tree Logic (PCTL) 4

Outline l Background Convex-MDP: MDP with Convex Uncertainty Sets Probabilistic Computation Tree Logic (PCTL) l Polynomial-Time Verification Algorithm 1 l Case Studies Randomized Consensus Protocol ZeroConf Protocol Behavior of a Human Driver 2 1. A. Puggelli et al., Proceedings of CAV2013 2. D. Sadigh et al., submitted to AAAI 2014 Symposium 5

Outline l Background Convex-MDP: MDP with Convex Uncertainty Sets Probabilistic Computation Tree Logic (PCTL) l Polynomial-Time Verification Algorithm 1 l Case Studies Randomized Consensus Protocol ZeroConf Protocol Behavior of a Human Driver 2 1. A. Puggelli et al., Proceedings of CAV2013 2. D. Sadigh et al., submitted to AAAI 2014 Symposium 5

f 2 Convex-MDP.8.7 S 0 START S 1 f 2 =.9 WORK a f 2 [.7-.8] Interval (.23,.77) (.28,.72).2.3 f 1 =.1 f U f 1 +f 2 =1 f 1 f 1 =.2 b f 2 S 2 FAIL f 1 =.8 f 1 [.2-.3] S 3 DONE Ellipsoidal f 1 +f 2 =1 l Action chosen by an Adversary l Transition probability distribution chosen by Nature l Transition probabilistically executed f 1 f 1 Likelihood f 1 +f 2 =1 A. Nilim, Robust Control of Markov Decision Processes with Uncertain Transition Matrices, 2005 6 f 2

Semantics of Convex-MDPs (1) S 0 START f 1 =.1 S 1 f 2 =.9 WORK a f 2 [.7-.8] f 1 =.2 b S 2 FAIL f 1 =.8 f 1 [.2-.3] Path S 3 DONE l Action chosen by an Adversary l Transition probability distribution chosen once by Nature l Transition probabilistically executed a, S 0 S [0.25,0.75] a, 1 S [0.25,0.75] 1 S 1 Static process: E.g. Variability in integrated circuits S 3 b, [0.2,0.8] 6

Semantics of Convex-MDPs (2) S 0 START f 1 =.1 S 1 f 2 =.9 WORK a f 2 [.7-.8] f 1 =.2 b S 2 FAIL f 1 =.8 f 1 [.2-.3] Path S 3 DONE l Action chosen by an Adversary l Transition probability distribution chosen at each step by Nature l Transition probabilistically executed a, S 0 S [0.25,0.75] a, 1 S [0.22,0.78] 1 S 1 Time or space varying process: E.g. Quality of a wireless link S 3 b, [0.2,0.8] 6

Probabilistic Computation Tree Logic l Logic syntax Next Bounded Until l Logic semantics Unbounded Until l Verification algorithm: solve the optimization problem 7

Which Logic to Use? l Qualitative logics (LTL, CTL): Pros: efficient algorithms, Cons: only give yes/no answers l Quantitative logics: PCTL Pros: efficient algorithms, enables quantitative analysis Cons: can t express arbitrary liveness and fairness properties ω-pctl 1 Pros: quantitative analysis, express safety, liveness, fairness Cons: no efficient algorithm 1. K. Chatterjee et al., Model-Checking ω-regular Properties of Interval Markov Chains, TACAS 2008

Outline l Background Convex-MDP: MDP with Convex Uncertainty Sets Probabilistic Computation Tree Logic (PCTL) l Polynomial-Time Verification Algorithm 1 l Case Studies Randomized Consensus Protocol ZeroConf Protocol Behavior of a Human Driver 2 1. A. Puggelli et al., Proceedings of CAV2013 2. D. Sadigh et al., submitted to AAAI 2014 Symposium 5

New Results in Theoretical Complexity l Size of Convex-MDP R = O(#States x #Transitions x #Actions) l Size of PCTL formula Q = O(#Operators(excluding U k ) + #(U k ) x k max ) PCTL Operator Puggelli 13 Verification Complexity Chatterjee 08* In R In Q In R In Q Qualitative P P P P Next (X) P P co-np P Bounded Until (U k ) P Pseudo-P in k max - - Unbounded Until (U) P P co-np P *Only interval uncertainties 8

Unbounded Until in Markov Chains x i 1. = Probability of satisfying for i-th state 2. Set up and solve the system of equations! x 2 = 0 # x 3 =1 " # x 0 = 0.9 + 0.1x 2 $ # = 0.2 + 0.7x 3 + 0.1x 2 3. #Equations = #States = N 4. Algorithmic complexity O(N 3 ) è Polynomial in R 9

Unbounded Until in MDP 1. Need to consider the worst-case adversary (historyless-deterministic enough) 2. Set up and solve the linear program max x s.t. x i x 2 = 0 x 3 =1 0.3 + 0.7x 3 0.2x 2 + 0.8x 3 x 0 = 0.9 + 0.1x 2 3. #Constraints = O(#States x #Actions) 4. Interior Point è Algorithmic complexity polynomial in R 10

Unbounded Until in Convex-MDP 1. Need to consider the worst-case adversary and nature 2. Set up the optimization problem max x x i s.t. x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 0.2x 2 + 0.8x 3 min f U f 1 x 3 + f 2 3. The adversarial nature minimizes the upper bound on 4. To maintain convexity, need to add one constraint f U 5. Uncountably infinite number of constraints: cannot solve x i 11

Unbounded Until in Convex-MDP l Try all Probability Distributions? max x x i s.t. x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 0.2 + 0.7x 3 + 0.1x 2 S 0 START f 2 =.9 f 1 =.1 f 2 [.7-.8] f 1 =.2 S b 1 WORK a S 2 FAIL f 1 =.8 f 1 [.2-.3] S 3 DONE 0.2 + 0.8x 3 0.25 + 0.75x 3 0.21 + 0.79x 3... l NO: Uncountably infinite number of distributions f 2.8.7 Interval.2.3 f U f 1 +f 2 =1 f 1

Dual Transformation for the Inner Problem l Primal Problem σ (x ) = min f U f 1 x 3 + f 2 l Dual Problem Convex Number of dual variables and constraints is polynomial in R Strong duality holds: d(x ) = max λ D g(λ, x ) σ (x ) g(λ, x ) λ d(x D ) = σ (x ) Cost Primal Cost Function σ f, λ Upper bound on x i d Dual Cost Function 12

New Formulation Original formulation max x s.t. x 2 = 0 x 3 =1 x i x 0 = 0.9 + 0.1x 2 Dual transformation of the inner problems max x s.t. x i x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 New formulation (drop all inner problems) max x,λ s.t. x 2 = 0 x 3 =1 x i x 0 = 0.9 + 0.1x 2 0.2x 2 + 0.8x 3 min f U f 1 x 3 + f 2 0.2x 2 + 0.8x 3 max λ D g(λ, x) 0.2x 2 + 0.8x 3 g(λ, x) λ D Primal Unbounded Until can be verified by solving one convex problem with a number of variables and constraints polynomial in R. f, λ Dual 13

Outline l Background Convex-MDP: MDP with Convex Uncertainty Sets Probabilistic Computation Tree Logic (PCTL) l Polynomial-Time Verification Algorithm 1 l Case Studies Randomized Consensus Protocol ZeroConf Protocol Behavior of a Human Driver 2 1. A. Puggelli et al., Proceedings of CAV2013 2. D. Sadigh et al., submitted to AAAI 2014 Symposium 5

Randomized Consensus Protocol [Aspnes 90] f H = 0.5 f T = 0.5 P2 f H = 0.5 f T = 0.5 P1 C P4 P3 f H [0.4 0.6] f T [0.4 0.6] l Study the probability of agreement in a network of asynchronous processes l Uncertainty models a faulty/compromised process which tosses a biased coin f H = 0.5 f T = 0.5 15

Randomized Consensus Protocol l With fair coins, the probability of agreement increases for increasing protocol rounds l In the presence of uncertainty, increasing the protocol rounds instead decreases the probability of agreement The proposed analysis allows a better tuning of protocol parameters to accommodate for faulty/compromised processes 16

Runtime Analysis l Use MOSEK as background LP solver l Size of the convex problem and runtime scale polynomially l Comparable with PRISM 2 and 1000x faster than PARAM 3 1. www.mosek.com 2. Kwiatkowska et al., PRISM 4.0: Verification of Probabilistic Real-time Systems 3. Hahn et al., Synthesis for PCTL in Parametric Markov Decision Processes

ZeroConf Protocol [Cheshire 05] l Study the QoS of a network configuration protocol for domotic applications l Model the network as a Timed Automata (source: doip.org) Likelihood l Maximum likelihood estimator to model the losses in the (physical) wireless channel f 1 +f 2 =1 f 1 f 2 18

ZeroConf Protocol Probability of failure High Uncertainty Low Uncertainty l Probability of failing to register to the network within a preset deadline l Analysis with no uncertainties largely underestimates the probability of failure Our analysis enables a robust configuration of protocol parameters to fit variable conditions of operation 19

Why Modeling the Driver Behavior? More effective teaching strategies Collision avoidance Lane changing Assisted maneuvers (Semi)-Autonomous Driving Driving regulations and insurance terms

Data Collection l Focus on modeling differences between attentive and distracted driving 1 Scenario 1: No distraction, no obstacle Scenario 2: Distraction, no obstacle Distraction Scenario 3: No distraction, Obstacle Obstacle Scenario 4: Distraction, Obstacle Distraction Obstacle 1. V. Vasudevan et al., Safe Semi-Autonomous Control with Enhanced Driver Modeling, ACC 2012 t

Library of Atomic Behaviors l Library of atomic labels L = {distracted, attentive, swerving, braking, accelerating, right lane, left lane } l Modes 2 L E.g. m 1 =(distracted, right lane) l Goal: Predict vehicle trajectories for each mode l Measured inputs: Driver steering angle (every 30ms) Driver pose -> proxy for attention level l Cluster measured inputs into the available atomic modes l For each mode, use a model of vehicle dynamics to predict possible trajectories for 1.2s Example: Mode = (right lane, straight, distracted)

Model Creation l Modes are interpreted as states of the Convex-MDP l Transition probabilities are computed based on empirical frequencies of trajectory end-points.

Analysis of a Complex Maneuver l Repeat the process to build the model of a complex maneuver

Verified Properties l Evaluating different driving styles l Estimating probability of threats

Comparison among Uncertainty Models P max [attentive U unsafe] P min [right lane U final] l With no uncertainty, results might be overly optimistic l Both uncertainty models trained with 95% confidence Interval model might be overly pessimistic Likelihood model is a statistically-valid compromise

Sensitivity to the Uncertainty Level P max [attentive U unsafe] l Attentive driver always perform better (gap varies among individuals!) l Depending on the specification, a different level of confidence is required -> guide on how to train the model!

Characterization of Individual Driving Styles l Compare driving styles S2 worst on keeping the right lane S3 brakes less often l The presence of an obstacle always increases the probability of threats

Conclusions and Future Work l Proposed a polynomial time algorithm for the verification of PCTL properties of MDPs l Lowered theoretical complexity for Interval-MDPs from co-np to P and extended to a large class of non-linear convex models of uncertainty l Applied to the verification of the behavior of a human driver l Application to further case studies (e.g. pricing of renewable energy) l Theory extensions: Continuous-Time Markov Chains Compositional methods (assume-guarantee) Stochastic control Source code available at: http://www.eecs.berkeley.edu/~puggelli/ 20

Runtime Analysis l Use MOSEK as background convex solver l Size of the convex problem and runtime scale polynomially l Comparable with PRISM 2 and 1000x faster than PARAM 3 1. www.mosek.com 2. Kwiatkowska et al., PRISM 4.0: Verification of Probabilistic Real-time Systems 3. Hahn et al., Synthesis for PCTL in Parametric Markov Decision Processes 17

Unbounded Until in Convex-MDP S 0 START f 2 =.9 f 2 =.3 f 1 =.1 f 1 =.2 S b 1 WORK a f 1 =.7 S 2 FAIL f 1 =.8 S 3 DONE

Unbounded Until in Convex-MDP l Try all Probability Distributions? max x x i s.t. x 2 = 0 S 2 x 3 =1 x 0 = 0.9 + 0.1x 2 0.2 + 0.7x 3 + 0.1x 2 0.2 + 0.8x 3 0.25 + 0.75x 3 0.21 + 0.79x 3... S 0 START f 2 =.9 f 1 =.1 f 2 [.7-.8] f 1 =.2 S b 1 WORK l NO: Uncountably infinite number of distributions a FAIL f 1 =.8 f 1 [.2-.3] S 3 DONE

Until Operator in CMDPs: Duality max x s.t. x i x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 0.2 + 0.7x 3 + 0.1x 2 0.2 + 0.8x 3 0.25 + 0.75x 3 0.21 + 0.79x 3... max x x i s.t. x 2 = 0 l Worst-case: Minimize the upper bound l x 3 =1 x 0 = 0.9 + 0.1x 2 min p U1 p 1,1 + p 1,2 x 2 + p 1,3 x 3 min p U2 p 2,1 + p 2,3 x 3 min p U1 p 1,1 + p 1,2 x 2 + p 1,3 x 3 Primal problem

Until Operator in CMDPs: Duality-Theory Approach max x x i s.t. x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 min p U1 p 1,1 + p 1,2 x 2 + p 1,3 x 3 min p U2 p 2,1 + p 2,3 x 3 max x x i s.t. x 2 = 0 x 3 =1 x 0 = 0.9 + 0.1x 2 max λ1 D 1 g 1 (λ 1, x) max λ2 D 2 g 2 (λ 2, x) l Substitute each primal problem with the corresponding dual problem