Trajectory Smoothing as a Linear Optimal Control Problem

Similar documents
Homework Solution # 3

Extremal Collective Behavior

1 Controllability and Observability

5th-order differentiation

Motion Camouflage for Coverage

Gramians based model reduction for hybrid switched systems

Model reduction for linear systems by balancing

Topic # Feedback Control Systems

Deterministic Dynamic Programming

Three-Dimensional Motion Coordination in a Spatiotemporal Flowfield

Nonlinear Control Systems

Duality and dynamics in Hamilton-Jacobi theory for fully convex problems of control

Chap. 3. Controlled Systems, Controllability

New ideas in the non-equilibrium statistical physics and the micro approach to transportation flows

Robotics. Dynamics. Marc Toussaint U Stuttgart

An introduction to Mathematical Theory of Control

The Frenet Serret formulas

A new statistical framework to infer gene regulatory networks with hidden transcription factors

Event-triggered stabilization of linear systems under channel blackouts

Stationary trajectories, singular Hamiltonian systems and ill-posed Interconnection

II. Unit Speed Curves

Problem 1 Cost of an Infinite Horizon LQR

Infeasibility Detection and an Inexact Active-Set Method for Large-Scale Nonlinear Optimization

arxiv: v1 [cs.sy] 29 Sep 2017

Min-Max Output Integral Sliding Mode Control for Multiplant Linear Uncertain Systems

Project Optimal Control: Optimal Control with State Variable Inequality Constraints (SVIC)

6.241 Dynamic Systems and Control

Presenter: Noriyoshi Fukaya

State Space Representation of Gaussian Processes

Nonlinear Analysis 71 (2009) Contents lists available at ScienceDirect. Nonlinear Analysis. journal homepage:

13 Path Planning Cubic Path P 2 P 1. θ 2

Objective. 1 Specification/modeling of the controlled system. 2 Specification of a performance criterion

Tutorial on Control and State Constrained Optimal Control Pro. Control Problems and Applications Part 3 : Pure State Constraints

Subject: Optimal Control Assignment-1 (Related to Lecture notes 1-10)

Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters

Theorema Egregium, Intrinsic Curvature

OHSx XM521 Multivariable Differential Calculus: Homework Solutions 14.1

OPTIMAL CONTROL OF A CLASS OF PARABOLIC

5.2 The Levi-Civita Connection on Surfaces. 1 Parallel transport of vector fields on a surface M

OPTIMAL CONTROL. Sadegh Bolouki. Lecture slides for ECE 515. University of Illinois, Urbana-Champaign. Fall S. Bolouki (UIUC) 1 / 28

Complex Laplacians and Applications in Multi-Agent Systems

Input to state Stability

CHAPTER 4 DIFFERENTIAL VECTOR CALCULUS

CALIFORNIA INSTITUTE OF TECHNOLOGY Control and Dynamical Systems. CDS 110b

16.31 Fall 2005 Lecture Presentation Mon 31-Oct-05 ver 1.1

MULTI-AGENT TRACKING OF A HIGH-DIMENSIONAL ACTIVE LEADER WITH SWITCHING TOPOLOGY

Linear-quadratic control problem with a linear term on semiinfinite interval: theory and applications

Feedback Optimal Control of Low-thrust Orbit Transfer in Central Gravity Field

Game Theory Extra Lecture 1 (BoB)

Probabilistic Graphical Models

Observability. It was the property in Lyapunov stability which allowed us to resolve that

9 Controller Discretization

Regularization of the Inverse Laplace Transform with Applications in Nuclear Magnetic Resonance Relaxometry Candidacy Exam

Alberto Bressan. Department of Mathematics, Penn State University

An introduction to Birkhoff normal form

An Extension of. Friedmann-Robertson-Walker Theory beyond Big Bang

Trajectory Tracking Control of Bimodal Piecewise Affine Systems

An Optimal Control Problem for Rigid Body Motions in Minkowski Space

Input-output finite-time stabilization for a class of hybrid systems

= m(0) + 4e 2 ( 3e 2 ) 2e 2, 1 (2k + k 2 ) dt. m(0) = u + R 1 B T P x 2 R dt. u + R 1 B T P y 2 R dt +

HIGHER ORDER SLIDING MODES AND ARBITRARY-ORDER EXACT ROBUST DIFFERENTIATION

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013

Tangent and Normal Vectors

Stochastic Differential Equations.

Interpolated Rigid-Body Motions and Robotics

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science : Dynamic Systems Spring 2011

e 2 = e 1 = e 3 = v 1 (v 2 v 3 ) = det(v 1, v 2, v 3 ).

Linear System Theory

TAM 212 Worksheet 3: Track design. Solutions

arxiv: v1 [math.pr] 18 Oct 2016

Tangent and Normal Vector - (11.5)

There is a function, the arc length function s(t) defined by s(t) = It follows that r(t) = p ( s(t) )

Spezielle Funktionen 4 Skalare Kugelfunktionen

EXAM 2 ANSWERS AND SOLUTIONS, MATH 233 WEDNESDAY, OCTOBER 18, 2000

Lecture 1 From Continuous-Time to Discrete-Time

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

On feedback stabilizability of time-delay systems in Banach spaces

AC&ST AUTOMATIC CONTROL AND SYSTEM THEORY SYSTEMS AND MODELS. Claudio Melchiorri

On the stochastic nonlinear Schrödinger equation

Adaptive Piecewise Polynomial Estimation via Trend Filtering

On Solvability of an Indefinite Riccati Equation

arxiv: v3 [math.ds] 22 Feb 2012

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Pythagorean-hodograph curves

Continuous Curvature Path Generation Based on Bézier Curves for Autonomous Vehicles

Limit solutions for control systems

MA 351 Fall 2007 Exam #1 Review Solutions 1

Nonlinear Observers. Jaime A. Moreno. Eléctrica y Computación Instituto de Ingeniería Universidad Nacional Autónoma de México

Quadratic estimates and perturbations of Dirac type operators on doubling measure metric spaces


On Irregular Linear Quadratic Control: Deterministic Case

DEPARTMENT OF ELECTRICAL AND ELECTRONIC ENGINEERING EXAMINATIONS 2010

Optimal Linear Feedback Control for Incompressible Fluid Flow

Lecture Note 13:Continuous Time Switched Optimal Control: Embedding Principle and Numerical Algorithms

Kalman Filter and Parameter Identification. Florian Herzog

An efficient approach to stochastic optimal control. Bert Kappen SNN Radboud University Nijmegen the Netherlands

INERTIALIZATION: HIGH-PERFORMANCE ANIMATION TRANSITIONS IN GEARS OF WAR DAVID BOLLO PRINCIPAL SOFTWARE ENGINEER THE COALITION (MICROSOFT STUDIOS)

Time-Invariant Linear Quadratic Regulators!

Reachability and Controllability

Cubic spline collocation for a class of weakly singular Fredholm integral equations and corresponding eigenvalue problem

Transcription:

Trajectory Smoothing as a Linear Optimal Control Problem Biswadip Dey & P. S. Krishnaprasad Allerton Conference - 212 Monticello, IL, USA. October 4, 212

Background and Motivation 1 Background and Motivation 2 Problem Generative Model Inverse Problem Relationship between Linear and Non-linear Generative Models 3 Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Existence of Optimal Initial Condition Optimal Reconstruction as a Linear Smoother Co-State Based Approach 4 Cross-validation Approach to Inverse Problem 5 Numerical Results

Background and Motivation Background and Motivation To explore underlying strategies and motion (pursuit, collective motion etc.) governing control laws, by extracting parameters of motion (namely curvature, speed, lateral acceleration etc.) from sampled observations of trajectories. To extract control inputs from sampled data.

Problem 1 Background and Motivation 2 Problem Generative Model Inverse Problem Relationship between Linear and Non-linear Generative Models 3 Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Existence of Optimal Initial Condition Optimal Reconstruction as a Linear Smoother Co-State Based Approach 4 Cross-validation Approach to Inverse Problem 5 Numerical Results

Problem Generative Model Generative Models for a Curve inr 3 (Non-linear and Linear) Natural Frenet Frame ṙ = νt T = ν(k 1 M 1 +k 2 M 2 ) Ṁ 1 = νk 1 T Ṁ 2 = νk 2 T Trajectory T r M2 M1 (1) The natural curvatures are the steering inputs and the speed is a time function dictated by propulsive/lift/drag mechanisms. Linear Generative Model ṙ = v v = a ȧ = u (2) Jerk, i.e. the third-derivative of position, is viewed as the control. LTI representation ẋ = Ax+Bu r = Cx with, x = [ r T v T a T ] T ; A = I I ;B = I C = [ I ] ; (3) Controllable and Observable

Problem Inverse Problem Regularized Inverse Problem T M2 : Data Points r M1 Trajectory Given a time series of observed positions, generate a smooth trajectory to fit the data points. The inverse problem is ill-posed. Highly sensitive to noise. Non-unique. A regularization parameter is introduced to control the amount of smoothing. Ordinary cross validation is a standard approach to choose an optimal value for the regularization parameter.

Problem Inverse Problem Extracting Curvature (Inverse Problem) Non-linear Optimization Minimize Observations r i Regularized Inversion States, Controls r, [T,M 1,M 2],k 1,k 2,ν ( N ) T r(t i ) r i 2 +λ ( k 1 2 + k 2 2 + ν 2 )dt i= subject to Dynamics in (1), Initial Condition, and Input (4) Linear-Quadratic Control Minimize Observations r i ( N i= Regularized Inversion States, Controls r,v,a,u ) T r(t i ) r i 2 +λ u T udt subject to Dynamics in (3), Initial Condition, and Input (5)

Problem Relationship between Linear and Non-linear Generative Models Relationship between Two Approaches for Modelling a Curve Natural-Frenet Frame Linear Model (Triple Integrator) v = νt a = νt +ν 2 k 1 M 1 +ν 2 k 2 M 2 u = ( ν ν 3 (k1 2 +k2 2 ))T +(3ν νk 1 +ν 2 k1 )M 1 +(3ν νk 2 +ν 2 k2 )M 2 Linear Model (Triple Integrator) Natural-Frenet Frame ν = v T = v v T = 1 ν (a (a T)T) κ = T ν τ = v (a u) v a 2 k 1,k 2,M 1,M 2 can be computed by assuming suitable intial conditions. ( t ) k 1 (t) = κcos θ + τ(σ)dσ ( k 2 (t) = κsin θ + t ) τ(σ)dσ t M 1 (t) = M 1 () ν(σ)k 1 (σ)t(σ)dσ t M 2 (t) = M 2 () ν(σ)k 2 (σ)t(σ)dσ

Optimal Control Based Approach for Trajectory Reconstruction 1 Background and Motivation 2 Problem Generative Model Inverse Problem Relationship between Linear and Non-linear Generative Models 3 Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Existence of Optimal Initial Condition Optimal Reconstruction as a Linear Smoother Co-State Based Approach 4 Cross-validation Approach to Inverse Problem 5 Numerical Results

Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Application of Path Independence Lemma Optimal Control Problem: Minimize x(t ),u J(x(t ),u) = subject to x(t ) R n, u U, N i= Dynamics in (3) T r(t i ) r i 2 +λ u T udt (6) Path Independence: Along trajectories of (3) = x T (t i )K(t + i )x(t i) x T (t i+1 )K(t i+1 )x(t i+1) t [ ] T [ i+1 x K +A + T ][ K +KA KB x u B T K u t + i = x T (t i )η(t + i ) xt (t i+1 )η(t i+1 )+ t t + i i+1 [ x u ] dt ] T [ η +A T η B T η ] dt for all i {,1,,N 1}

Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Application of Path Independence Lemma Assumptions on the the dynamics and boundary values of K and η: K = A T K KA+KBB T K, K(t + N ) =, K(t + i ) K(t i ) = 1 λ CT C. (7) With the assumptions (7) and (8), we obtain η = ( A T KBB T) η, η(t + N ) =, η(t + i ) η(t i ) = 2 λ CT r i. ] N J(x(t ),u) = λ [x T (t )K(t )x(t )+x T (t )η(t ) + ri T r i 1 T 4 λ B T η(t) 2 dt i= T ( +λ u(t) +B T K(t)x(t)+ 1 ) 2 η(t) 2 dt. (9) Optimal control input: u opt (t) = B T ( K(t)x(t)+ 1 2 η(t) ) (1) Optimal initial condition: ] [K(t ) x opt (t )+ 1 2 η(t ) =. (11) (8)

Optimal Control Based Approach for Trajectory Reconstruction Existence of Optimal Initial Condition Existence of Solution for (11) - Sketch of Proof Proposition 1 The solution of the Riccati equation (7) assumes the form K(t i ) = 1 N Φ Σ (t i,t k )C T CΦ T Σ λ (t i,t k ) k=i for any i {,1,,N} where Σ(t) = (A 1 2 BBT K(t)) T and Φ Σ is the transition matrix of Σ. Holds true for i = N. Apply mathematical induction. Proposition 2 ( Σ T,C) forms an observable pair for the problem of our interest (3). Apply Silverman-Meadows rank condition.

Optimal Control Based Approach for Trajectory Reconstruction Existence of Optimal Initial Condition Existence of Solution for (11) - Sketch of Proof Theorem 1 The equation [K(t ) ] x opt (t )+ 1 2 η(t ) =. is uniquely solvable for almost any time index set {t i } N i=. Observe K(t ) can be represented as K(t ) = 1 λ CT C, with C CΦ Σ T(t 1,t ) C =.. CΦ Σ T(t N,t ) Consider the system ξ = Σ T ξ; γ = Cξ. The outputs, corresponding to two different initial conditions, do not match identically over any interval. ξ a ξ b Cξ a Cξ b (almost surely) Otherwise, consider an arbitrary close perturbation of the original time index set {t i } N i=, to obtain full rank for C.

Optimal Control Based Approach for Trajectory Reconstruction Optimal Reconstruction as a Linear Smoother Linearity in the Reconstructed Trajectory Closed loop dynamics: ẋ(t) = Σ T x(t) 1 2 BBT η(t) with Σ = [ A BB T K(t) ] T. x opt(t ) and η( ) are linear in observed data {r i } N i=. where r(t k ) = 1 N [ CF λ λ (k,i)c T] r i (12) i= F λ (k,i) = Φ T Σ(t [ ] 1Φ Σ(t,t k ) K(t ),t i ) + min{i,k} j=1 ( tj t j 1 Φ T Σ(σ,t k )BB T Φ Σ(σ,t i )dσ ) Can be be viewed as a global alternative to Savitzky-Golay smoothing filters. Can be used as a building block to obtain a fixed lag smoothing algorithm.

Optimal Control Based Approach for Trajectory Reconstruction Co-State Based Approach An Alternative Co-State Based Approach Co-state variables: p(t) K(t)x(t)+ 1 2 η(t) An optimal trajectory between two observation times can be viewed as the base integral curve of the following Hamiltonian dynamics [ ] [ ][ ] d x(t) A BB T x(t) = dt p(t) A T p(t) Jump condition for the co-state variables: p(t + i ) p(t i ) = 1 λ CT (r i r(t i )) Terminal condition for the co-state variables: p(t + N ) = p(t ) =

Optimal Control Based Approach for Trajectory Reconstruction Co-State Based Approach An Alternative Co-State Based Approach Forward-propagation of x(t i ) and p(t + i ): [ ] [ x(ti+1 ) e A i e A iw i p(t + i+1 ) = ] 1 λ CT Ce A i [e AT i + 1λ CT Ce A iw i [ ] + 1 r i+1 λ CT where W i is defined as ] [ x(ti ) p(t + i ) ] W i = i e Aσ BB T e ATσ dσ ( i = t i+1 t i ) Optimal initial condition is obtained by solving ( N 1 ) [ ] I N N 1 [ I] Λ i 1 x(t i= λ CT C ) = [ I] Λ j Γr i (13) i= j=i where, [ Λ i = e A i 1 λ CT Ce A i e A iw i ] [e AT i + 1 λ CT Ce A iw i ] [ ;Γ = 1 λ CT ]

Cross-validation Approach to Inverse Problem 1 Background and Motivation 2 Problem Generative Model Inverse Problem Relationship between Linear and Non-linear Generative Models 3 Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Existence of Optimal Initial Condition Optimal Reconstruction as a Linear Smoother Co-State Based Approach 4 Cross-validation Approach to Inverse Problem 5 Numerical Results

Cross-validation Approach to Inverse Problem Cross-validation Approach to Determination of Penalty Parameter We use leaving-out-one version of the Ordinary Cross Validation (OCV) technique. Let, {x [λ,k] opt,u[λ,k] } be a minimizer of: N T r(t i ) r i 2 +λ u T udt i= i k Let the reconstructed trajectory be r [λ,k] ( ). Then the OCV cost is defined as: V (λ) = 1 N +1 N r [λ,k] (t k ) r k 2 k= Hence, OCV estimate for λ is defined as: λ = argmin V (λ) λ R +

Numerical Results 1 Background and Motivation 2 Problem Generative Model Inverse Problem Relationship between Linear and Non-linear Generative Models 3 Optimal Control Based Approach for Trajectory Reconstruction Path Independence Lemma Existence of Optimal Initial Condition Optimal Reconstruction as a Linear Smoother Co-State Based Approach 4 Cross-validation Approach to Inverse Problem 5 Numerical Results

Numerical Results Numerical Result - Spherical Curve Avg. Fit Error/Radius: 13.686 1 3.

Numerical Results Numerical Result - Circular Helix Avg. Fit Error/Radius: 12.346 1 3.

References References E. Justh and P. S. Krishnaprasad, Optimal Natural Frames, Comm. Inf. Syst., 11(1):17-34, 211. T. Flash and N. Hogan, The coordination of arm movements: An experimentally confirmed mathematical model, The Journal of Neuroscience, 5(7):1688-173, 1985. R. L. Bishop, There is more than one way to frame a curve, The American Mathematical Monthly, 82(3):246-251, 1975. L. M. Silverman and H. E. Meadows, Controllability and observability in time-variable linear systems, SIAM J. on Control and Optimization, 5(1):64-73, 1967. P.V. Reddy, Steering laws for pursuit, M. S. Thesis, University of Maryland, College Park, 27. B. Dey and P. S. Krishnaprasad, Trajectory smoothing as a linear optimal control problem (extended version), http://www.isr.umd.edu/labs/isl/smoothing.

???