Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints

Similar documents
Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints

Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging and Temporal Logic Constraints

Persistent surveillance for unmanned aerial vehicles subject to charging and temporal logic constraints

Dynamic Routing of Energy-Aware Vehicles with Temporal Logic Constraints

Distributed Multi-Agent Persistent Surveillance Under Temporal Logic Constraints

Receding Horizon Control in Dynamic Environments from Temporal Logic Specifications

Correct-by-Construction Control Synthesis for Multi-Robot Mixing

ONR MURI AIRFOILS: Animal Inspired Robust Flight with Outer and Inner Loop Strategies. Calin Belta

Op#mal Control of Nonlinear Systems with Temporal Logic Specifica#ons

Time-Constrained Temporal Logic Control of Multi-Affine Systems

Nonlinear Landing Control for Quadrotor UAVs

Optimal Event-Driven Multi-Agent Persistent Monitoring of a Finite Set of Targets

Synthesis of Distributed Control and Communication Schemes from Global LTL Specifications

Dynamic and Adversarial Reachavoid Symbolic Planning

Online Task Planning and Control for Aerial Robots with Fuel Constraints in Winds

arxiv: v1 [cs.sy] 26 Mar 2012

Optimal Multi-Robot Path Planning with LTL Constraints: Guaranteeing Correctness Through Synchronization

16.410/413 Principles of Autonomy and Decision Making

arxiv: v1 [cs.ro] 12 Mar 2019

LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees

Memoryless Control Design for Persistent Surveillance under Safety Constraints

EE C128 / ME C134 Feedback Control Systems

Analysis and Design of Hybrid AI/Control Systems

Decentralized Stabilization of Heterogeneous Linear Multi-Agent Systems

Planning Under Uncertainty II

Autonomous Agent Behaviour Modelled in PRISM A Case Study

Optimal Multi-Robot Path Planning with LTL Constraints: Guaranteeing Correctness Through Synchronization

Route-Planning for Real-Time Safety-Assured Autonomous Aircraft (RTS3A)

Quadrotor Modeling and Control

Adaptive Robust Control (ARC) for an Altitude Control of a Quadrotor Type UAV Carrying an Unknown Payloads

Synthesis of Reactive Switching Protocols from Temporal Logic Specifications

The Multi-Agent Rendezvous Problem - The Asynchronous Case

Optimal Control of Mixed Logical Dynamical Systems with Linear Temporal Logic Specifications

Optimal Control of Markov Decision Processes with Temporal Logic Constraints

Multi-robot Persistent Coverage with Stochastic Task Costs

MDP Optimal Control under Temporal Logic Constraints - Technical Report -

Counterexamples for Robotic Planning Explained in Structured Language

Intermittent Connectivity Control in Mobile Robot Networks

arxiv: v1 [cs.sy] 8 Mar 2017

arxiv: v2 [cs.ro] 10 Jul 2012

Exploiting System Structure in Formal Synthesis

Probabilistic Model Checking and Strategy Synthesis for Robot Navigation

Hierarchical Hybrid Symbolic Robot Motion Planning and Control

Formal Synthesis of Embedded Control Software: Application to Vehicle Management Systems

Cooperative Decentralized Multi-agent Control under Local LTL Tasks and Connectivity Constraints

Assistive Collision Avoidance for Quadrotor Swarm Teleoperation

Preface. Motivation and Objectives

Anytime Planning for Decentralized Multi-Robot Active Information Gathering

Sub-Optimal Scheduling of a Flexible Batch Manufacturing System using an Integer Programming Solution

Decomposition of planning for multi-agent systems under LTL specifications

Georgios E. Fainekos, Savvas G. Loizou and George J. Pappas. GRASP Lab Departments of CIS, MEAM and ESE University of Pennsylvania

Automata-based Verification - III

Alan Bundy. Automated Reasoning LTL Model Checking

Synthesis of Reactive Switching Protocols From Temporal Logic Specifications

Mdp Optimal Control under Temporal Logic Constraints

TEMPORAL LOGIC [1], [2] is the natural framework for

CHAPTER 3: INTEGER PROGRAMMING

Monitoring using Heterogeneous Autonomous Agents

The PVTOL Aircraft. 2.1 Introduction

Energy-Aware Coverage Path Planning of UAVs

Optimal Control of MDPs with Temporal Logic Constraints

Synthesis of Correct-by-Construction Behavior Trees

Planning Persistent Monitoring Trajectories in Spatiotemporal Fields using Kinodynamic RRT*

arxiv: v2 [cs.ro] 15 Jul 2010

THE objective of this paper is to synthesize switching. Synthesis of Reactive Switching Protocols from Temporal Logic Specifications

Information-guided persistent monitoring under temporal logic constraints

Resilient Formal Synthesis

Runtime Model Predictive Verification on Embedded Platforms 1

Multi-layer Flight Control Synthesis and Analysis of a Small-scale UAV Helicopter

Unifying Behavior-Based Control Design and Hybrid Stability Theory

Chapter 3: Discrete Optimization Integer Programming

Physics-Aware Informative Coverage Planning for Autonomous Vehicles

Assume-guarantee Cooperative Satisfaction of Multi-agent Systems

Hierarchical Synthesis of Hybrid Controllers from Temporal Logic Specifications

From Abstract Models to Executable Models for Multi-Agent Path Finding on Real Robots

Design and modelling of an airship station holding controller for low cost satellite operations

Trajectory tracking & Path-following control

THE classical objectives considered in automatic control

A Decentralized Approach to Multi-agent Planning in the Presence of Constraints and Uncertainty

Automata-based Verification - III

Hover Control for Helicopter Using Neural Network-Based Model Reference Adaptive Controller

International ejournals

arxiv: v1 [cs.ro] 17 Mar 2014

Multi-Objective Planning with Multiple High Level Task Specifications

A Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications

Responsive Imaging Constellations for Support of Geographically Dispersed Theaters

Towards Reduced-Order Models for Online Motion Planning and Control of UAVs in the Presence of Wind

Nonlinear Control of a Quadrotor Micro-UAV using Feedback-Linearization

Multi-Robot Mixing Using Braids

Model Checking: An Introduction

Motion planning applications of Satisfiability Modulo Convex Optimization

UAV Navigation: Airborne Inertial SLAM

Abstractions and Decision Procedures for Effective Software Model Checking

1 Introduction. 2 Successive Convexification Algorithm

arxiv: v1 [cs.ro] 4 Oct 2016

An Active Learning Approach For Inferring Discrete Event Automata

Autonomous Mobile Robot Design

Path Planning for Time-Optimal Information Collection

Linear Time Logic Control of Discrete-Time Linear Systems

Transcription:

Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints Kevin Leahy, Dingjiang Zhou, Cristian-Ioan Vasile, Konstantinos Oikonomopoulos, Mac Schwager, and Calin Belta Boston University, Boston MA 0225 Motivation, Problem Statement, and Related Work In this paper, we investigate the automatic deployment of multiple quadrotors under resource constraints. The relatively short battery life in many unmanned aerial vehicles (UAVs) presents a significant barrier to their use in complex, long term surveillance missions. Moreover, the use of multiple vehicles allows for more complex behavior and longer mission horizons, but further complicates the task of deploying those vehicles given limited flight time. We present an algorithm that generates a feedback controller for multiple quadrotors with charging constraints to meet a complex temporal logic specification. The algorithm comprises a three-part tool chain that first plans a high level routing schedule for the quadrotors, then generates a vector field control input for the quadrotors to accomplish the schedule, and finally controls the quadrotors nonlinear dynamics to follow the vector field with a feedback controller. The performance of the complete system, with its three interacting parts, is investigated in 50 experimental runs using two quadrotors in a motion capture environment. As a motivating example, we consider the environment shown in Fig. consisting of three charging stations, three regions of interest, and two aerial vehicles. We assume vehicle battery life is 40 time units, and charging takes 20 time units, where time units are a generic unit that can be instantiated based on a particular implementation. Given this environment and these battery and charging constraints, we wish to satisfy the following mission specification infinitely often: within 6 time units observe Region R3 for at least 3 time units; within 28 time units, observe Region R for at least 2 time units; and within 46 time units, observe Region R2 for at least 2 time units then within 8 time units observe Region R or Region R3 for at least 2 time units. We seek a method to generate a control policy ensuring that vehicles can be automatically deployed to successfully complete this mission in the specified environment. Formally, the problem we seek to solve is: given an environment and a temporal logic mission specification with time deadlines that needs to be satisfied infinitely often, generate control policies for a team of quadrotors to complete the mission, while ensuring vehicles remain charged and collisions are avoided. The solution to this problem requires the use of several sophisticated systems, whose interaction both at a theoretical level and an experimental level produces many unique challenges. Our approach is related to the Vehicle Routing Problem

(VRP) [], which can be summarized as: given N identical vehicles at a depot and the distances among all sites and the depot, find a minimum distance tour for each vehicle such that it begins and ends at the depot and visits each site at least once. By placing time bounds on when each site must be visited, we obtain a version of VRP called Time Window VRP (VRPTW). Recent related work includes [2] where the authors propose a fragment of metric temporal logic, which restricts temporal operators to atomic propositions and their negation. Our approach has fewer restrictions, including allowing for a vehicle to visit a site multiple times during a tour if it is required. Multi-agent control for the VRPTW has also been considered without temporal logic constraints in [3][4]. 2 Technical Approach To capture the richness of the specification, we use bounded linear temporal logic (BLTL), a temporal logic with time bounds on each of its temporal operators. The mission specification presented above can be expressed as G,where is given in () as a BLTL formula and the G operator indicates that should be satisfied infinitely often. = F apple6 G 3 3 ^ F apple28 G 2 ^ F apple46 G 2 2 ^ F apple0 G 2 ( _ 3 ) () In (), ^ and _ are the usual Boolean operators indicating conjunction and disjunction, while F and G are the temporal operators eventually and always, respectively. Superscripts on the temporal operators are time bounds on those operators. Each i is the atomic proposition that is true when Region Ri is being observed. Flight through the environment generates a word over, the set of the atomic propositions, which can be checked against () for satisfaction of the specification. Finding a control policy that satisfies () necessitates representing the environment as a weighted transition system, where the charging stations and regions of interest are the states and the transitions are possible paths among the states. The weights on the transitions represent travel times along those paths. To generate the transition system, the environment is first partitioned into cubes. Paths between all pairs of regions of interest are found as sequences of these cubes. The paths are constrained such that quadrotors fly to a fixed height from the charging stations and perform all observations from that fixed altitude. From these paths, a vector field is generated to ensure each sequence of cubes is followed. The vector field everywhere inside a given cube is a convex combination of the vectors at its vertices [5]. Such a vector field can be used to keep the vehicle from leaving the cube (stay-in-cell) or to force it to leave through a given facet (control-to-facet), as displayed in Fig. 2. The weights on the transition system are the upper bounds on travel time along each sequence of cubes, which can be calculated using methods previously explored [6]. Hovering over a region or charging are modeled as self-loop transisitons of weight. Once the transition system is created, the charging and battery constraints are converted to finite state automata (FSA). A series of product automata are

created from the charging FSA; specification (), encoded as an FSA;and the transition system. Djikstra s algorithm is used on the final product automaton to find satisfying minimum loop-time control strategies, for which the vector fields have already been generated. Only mutually exclusive operation of quadrotors is considered in this framework and thus collision avoidance is conservatively guaranteed. Following the vector field is accomplished using a controller that exploits the fact that quadrotor dynamics are known to be di erentially flat [7]. The controller calculates a control input based on a flat output the quadrotor position and yaw angle and its derivatives, which can be calculated analytically from the vector field. 3 Results The partitioned environment (Figs. & 5) consists of 385 cubes each with edge length 0.36m. Control policies for G were calculated over the transition system displayed in Fig.. The computation time, excluding encoding of (), was 30.7 seconds, and the final product automaton had 579,54 nodes and 2,079,208 edges. No solutions were found for quadrotors starting on Chargers C2 and C3, but all other combinations of starting positions yielded solutions. C3 2 6 0 8 R2 5 3 C2 5 5 7 R3 5 5 R 0 3 7 3 C Fig.. Partitioned environment viewed from above and transition system. Green squares are charging stations, while blue squares are regions of interest. States in the transition system are charging stations and regions of interest. Weights on transitions are based on calculated time bounds. 4 Experiments Experiments were performed in the Boston University Aerial Robotics Arena. The Arena consists of a flight space with IR cameras to track reflective markers

.5 Z 0.5!0.5 0 0 0.5 0 0.5.5.5 X!0.5!0.5 0 0.5.5 Fig. 2. Vector field detail and quadrotor flight data displayed in vector field. The cube Y X at the top left shows a control-to-facet vector field, and the cube at the bottom left shows a stay-in-cell vector field. One of these two kinds of fields is given to the quadrotor in each cell along its path to guide it through the desired trajectory.!0.5 on the quadrotors using an OptiTrack system. This system allows for real-time localization of the quadrotors during experiments. Two K500 quadrotors from KMel robotics were used to execute the control policies described in Sec. 3. Charging stations (Fig. 3) were designed and built at Boston University for automatic docking and charging of quadrotors. These platforms allow one vehicle to land and another to take o, ensuring continuous monitoring in situations where one vehicle may not be able to satisfy a persistent monitoring mission specification on its own. The charging stations are made of laser cut acrylic parts connected with PLA plastic 3D printed parts. The electronics of the station consist of the Hyperion EOS0720i Net3AD charger, modified to enable control by MATLAB. The charging pads are 40 stainless steel. To secure a robust connection with the stainless steel pads of the charging station, the quadrotors are equipped with stainless steel contacts mounted on springs with magnets. The platform is entirely controlled by MATLAB via USB connection, allowing for the detection of the presence of a quadrotor, real-time monitoring of battery and charging status, and control of the charging parameters including battery type, capacity, and charging rate. The maximum charging rate that can be achieved is 8 Amperes. Figure 4 shows the results of a flight by two quadrotors. Seconds were used as the time units for these experiments so flights could be rapidly performed and analyzed. The flights consist of two loops that each satisfy. Since can be satisfied repeatedly, these flights can satisfy the mission specification, G. In the first loop, Quadrotor takes o first and Quadrotor 2 completes the loop, while in the second loop the flight order is reversed. The second loop of the flight resulted in the quadrotors landing on the same charging stations from which they took o, and any subsequent loops would be identical to this second loop.

Figure 4 shows that the specification was satisfied for both loops in the flight. Region R was visited in 5.8 seconds in Loop and 7.5 seconds in Loop 2, ahead of the 28 second deadline. Likewise, Region R3 was visited in 2.4 and 2.6 seconds ahead of the 6 second deadline. In the second portion of each loop, Region R2 was visited in 34.0 and 30.3 seconds with a deadline of 46 seconds, and Region R was visited within the 8 second deadline after each visit to Region R2. The two-loop flight described above was performed Fig. 3. Quadrotor resting on charging station. 50 times, and both quadrotors were consistent in their flight times. The standard deviation in the length of each portion of the flight time was on the order of 0.s. Despite this consistency, the time bound on flying from Charger C to Region R was violated by the second quadrotor in each flight, while not being violated by the first quadrotor. Fig. 4. Timeline of quadrotor flights for two loops. The first two rows display the first loop, with Quadrotor flying before Quadrotor 2. The next two rows show the second loop, with Quadrotor 2 flying first. 5 Main Experimental Insights The implementation of the persistent surveillance framework required three systems to be integrated together: the BLTL control policy algorithm, the vector field, and the quadrotor controller. Inevitably, the limitations to a theoretical framework appear at the interface of such systems. For example, the use of multiple vehicles required tuning the controllers quite di erently to ensure that the vector field was followed, even though the vehicles are of the same make and model. Regardless of any such complications, because a conservative approach was used, such as using upper bounds on travel time rather than expected travel time, the system met the specifications reliably and predictably. These experiments establish a framework that can be extended to larger groups of vehicles flying simultaneously, more complex distributed mission tasks, and longer experimental mission horizons.

Fig. 5. Screencaps of the first flight loop. The main insight gleaned from this experiment is how to automate a complex persistent surveillance mission specified as a temporal logic formula. The methodology explained herein allows for rapid experimentation following theoretic work using temporal logics. By using the environment partition and transition system generation with time bounds, minimal human input is required to establish an experimental framework for simulating and executing missions. Further, the inclusion of charging stations, whose performance can be modeled using automata, allows for long-term, truly persisent missions involving multiple vehicles not only to be modeled, but to actually be performed in the lab. References. G.B. Dantzig and J.H. Ramser. The truck dispatching problem. Management Science, 6():80 9,959. 2. S. Karaman and E. Frazzoli. Vehicle routing problem with metric temporal logic specifications. In Proc. of the IEEE Conference on Decision and Control (CDC08), pages 3953 3958. IEEE, 2008. 3. N. Michael, E. Stump, and K. Mohta. Persistent surveillance with a team of mavs. In Proc. of the International Conference on Intelligent Robots and Systems (IROS ), pages2708 274.IEEE,20. 4. E. Stump and N. Michael. Multi-robot persistent surveillance planning as a vehicle routing problem. In Proc. of the IEEE Conference on Automation Science and Engineering (CASE), pages569 575.IEEE,20. 5. C. Belta and L.C.G.J.M. Habets. Controlling a class of nonlinear systems on rectangles. IEEE Transactions on Automatic Control, 5():749 759,2006. 6. E. Aydin Gol and C. Belta. Time-constrained temporal logic control of multi-a ne systems. Nonlinear Analysis: Hybrid Systems, 0:2 33,203. 7. D. Zhou and M. Schwager. Vector field following for quadrotors using di erential flatness. In Proc. of the International Conference on Robotics and Automation (ICRA), May204.