Optimization of a point-mass walking model using direct collocation and sequential quadratic programming

Similar documents
Switching Time Optimization in Discretized Hybrid Dynamical Systems

Chapter 2 Lagrangian Modeling

Numerical Integrator. Graphics

u t v t v t c a u t b a v t u t v t b a

Lecture 2 Lagrangian formulation of classical mechanics Mechanics

Table of Common Derivatives By David Abraham

Energy behaviour of the Boris method for charged-particle dynamics

Examining Geometric Integration for Propagating Orbit Trajectories with Non-Conservative Forcing

Diagonalization of Matrices Dr. E. Jacobs

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors

Lecture XII. where Φ is called the potential function. Let us introduce spherical coordinates defined through the relations

6 General properties of an autonomous system of two first order ODE

ON THE OPTIMALITY SYSTEM FOR A 1 D EULER FLOW PROBLEM

Mechatronics by Analogy and Application to Legged Locomotion

Ordinary Differential Equations: Homework 1

Text S1: Simulation models and detailed method for early warning signal calculation

Lagrangian and Hamiltonian Mechanics

Math 11 Fall 2016 Section 1 Monday, September 19, Definition: A vector parametric equation for the line parallel to vector v = x v, y v, z v

Calculus of Variations

The Principle of Least Action

Assignment 1. g i (x 1,..., x n ) dx i = 0. i=1

A Novel Decoupled Iterative Method for Deep-Submicron MOSFET RF Circuit Simulation

ECE 422 Power System Operations & Planning 7 Transient Stability

Adjoint Transient Sensitivity Analysis in Circuit Simulation

Lagrangian and Hamiltonian Dynamics

Topic 2.3: The Geometry of Derivatives of Vector Functions

Semi passive walking of a 7 DOF biped robot ABSTRACT 1 INTRODUCTION

Minimum-time constrained velocity planning

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

Angles-Only Orbit Determination Copyright 2006 Michel Santos Page 1

BEYOND THE CONSTRUCTION OF OPTIMAL SWITCHING SURFACES FOR AUTONOMOUS HYBRID SYSTEMS. Mauro Boccadoro Magnus Egerstedt Paolo Valigi Yorai Wardi

where: q = [q 1 ] T : vector of generalized coordinates and q and q are vectors of joint angular velocity and acceleration.

θ x = f ( x,t) could be written as

Separation of Variables

Thermal conductivity of graded composites: Numerical simulations and an effective medium approximation

ELEC3114 Control Systems 1

Strength Analysis of CFRP Composite Material Considering Multiple Fracture Modes

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control

2.25 m. (a) Using Newton s laws of motion, explain why the student can gain an initial speed to leave the ground vertically.

Chapter 9 Method of Weighted Residuals

Statics. There are four fundamental quantities which occur in mechanics:

ORDINARY DIFFERENTIAL EQUATIONS AND SINGULAR INTEGRALS. Gianluca Crippa

there is no special reason why the value of y should be fixed at y = 0.3. Any y such that

Introduction to the Vlasov-Poisson system

Math 342 Partial Differential Equations «Viktor Grigoryan

The dynamics of the simple pendulum

EVALUATING HIGHER DERIVATIVE TENSORS BY FORWARD PROPAGATION OF UNIVARIATE TAYLOR SERIES

Mechanical energy transfer by internal force during the swing phase of running

Experimental Robustness Study of a Second-Order Sliding Mode Controller

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21

arxiv: v1 [math.oc] 5 Oct 2016

Dynamical Systems and a Brief Introduction to Ergodic Theory

Average value of position for the anharmonic oscillator: Classical versus quantum results

From Local to Global Control

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency

A Course in Machine Learning

Goldstein Chapter 1 Exercises

Approximate reduction of dynamic systems

A Note on Exact Solutions to Linear Differential Equations by the Matrix Exponential

model considered before, but the prey obey logistic growth in the absence of predators. In

and from it produce the action integral whose variation we set to zero:

Real-time economic optimization for a fermentation process using Model Predictive Control

The Exact Form and General Integrating Factors

Role of parameters in the stochastic dynamics of a stick-slip oscillator

Nonlinear Adaptive Ship Course Tracking Control Based on Backstepping and Nussbaum Gain

State-Space Model for a Multi-Machine System

WUCHEN LI AND STANLEY OSHER

2-7. Fitting a Model to Data I. A Model of Direct Variation. Lesson. Mental Math

APPPHYS 217 Thursday 8 April 2010

ACCELERATION, FORCE, MOMENTUM, ENERGY : solutions to higher level questions

The Hamiltonian particle-mesh method for the spherical shallow water equations

New Simple Controller Tuning Rules for Integrating and Stable or Unstable First Order plus Dead-Time Processes

Optimized Schwarz Methods with the Yin-Yang Grid for Shallow Water Equations

More from Lesson 6 The Limit Definition of the Derivative and Rules for Finding Derivatives.

APPROXIMATE SOLUTION FOR TRANSIENT HEAT TRANSFER IN STATIC TURBULENT HE II. B. Baudouy. CEA/Saclay, DSM/DAPNIA/STCM Gif-sur-Yvette Cedex, France

MARKO NEDELJKOV, DANIJELA RAJTER-ĆIRIĆ

TRAJECTORY TRACKING FOR FULLY ACTUATED MECHANICAL SYSTEMS

Reactive Power Compensation in Mechanical Systems

Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity

NOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy,

Dusty Plasma Void Dynamics in Unmoving and Moving Flows

Proof of SPNs as Mixture of Trees

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y

Further Differentiation and Applications

Calculus I Practice Test Problems for Chapter 3 Page 1 of 9

Applications of First Order Equations

12.5. Differentiation of vectors. Introduction. Prerequisites. Learning Outcomes

Optimal Variable-Structure Control Tracking of Spacecraft Maneuvers

The Principle of Least Action and Designing Fiber Optics

Monopoly Part III: Multiple Local Equilibria and Adaptive Search

Stabilization of a Class of Underactuated Mechanical Systems via Interconnection and Damping Assignment

Time Optimal Flight of a Hexacopter

LQG FLUTTER CONTROL OF WIND TUNNEL MODEL USING PIEZO-CERAMIC ACTUATOR

221A Lecture Notes Notes on Classica Mechanics I

x f(x) x f(x) approaching 1 approaching 0.5 approaching 1 approaching 0.

Efficient Macro-Micro Scale Coupled Modeling of Batteries

Dot trajectories in the superposition of random screens: analysis and synthesis

Predictive Control of a Laboratory Time Delay Process Experiment

An Introduction to Event-triggered and Self-triggered Control

Explicit Formulas for General Integrals of Motion for a Class of Mechanical Systems Subject To Virtual Constraints

Transcription:

Optimization of a point-mass walking moel using irect collocation an sequential quaratic programming Chris Dembia June 5, 5

Telescoping actuator y Stance leg Point-mass boy m (x,y) Swing leg x Leg uring stance F(t) F(t) l(t) F(t) mg Boy uring stance mg Boy uring flight Figure : The point mass moel of walking from [SR6]. a) Some possible gaits b) Inverte penulum walk c) Impulsive run ) Hybri intermeiate gait: penular run Stance Flight Inverte penulum Bounce Inverte penulum Flight Flight l max Bounce lmax Figure : (a) Possible perioic gaits for the point mass moel. (b-) Optimal gaits for the point mass moel, epening on the prescribe spee an step step length of the gait [SR6]. Introuction It is possible to uncover some funamental principles of human locomotion with a moel that consists of only a point mass an a massless telescoping (compressible) leg. Srinivasan an Ruina foun that such a moel chooses to walk if aske to move at slow spees, an to run if aske to move at high spees [SR6]. The moel they use is shown in Figure. The moel interacts with its environment via a telescoping linear actuator (its leg). The motion consists of two phases: a stance phase, uring which the stance leg is in contact with the groun; an a swing phase uring which the point mass is in free flight. Specifically, their moel chose between three istinct gaits, epening on the spee an step length of the gait: an inverte penulum walk, an impulsive run, an a penular run; these are shown in Figure. Srinivasan an Ruina foun these gaits by solving an optimal control problem in which the objective was to maximize the energy efficiency of the gait. The parameters in the optimization consiste of the initial state of the moel an the force applie by the actuator throughout the stance phase. To evaluate the objective function, they performe a forwar simulation (integration) of the moel s equations of motion, starting from the initial state. Their metho is calle single shooting; the optimizer chooses an initial state an shoots to the final state by integrating. This metho is typically challenging for optimizers, since the motion of a nonlinear ynamical system can be very sensitive to the initial state. Another

approach is to iscretize the state of the moel through time, an to enforce the ynamics at these time points through explicit constraints in the optimization problem. This metho is calle irect collocation [Bet]. When formulate via irect collocation, optimal control problems are far less sensitive to variations in the parameters. Direct collocation problems have many more parameters than single shooting problems, but are extremely sparse. In this work, we replicate the work of Srinivasan an Ruina using irect collocation instea of single shooting. We also solve the problem with a suite of ifferent cost functions, an explore the effect of these ifferent costs on the resulting motion. The problem Here, we escribe the optimal control problem that Srinivasan an Ruina [SR6] solve. The variables consist of the moel s state s(t) = (x(t), y(t), ẋ(t), ẏ(t)), where (x(t), y(t)) is the position of the point mass; the force applie to the groun by the leg F (t); an the uration of the stance phase t s. The step length, spee v, an step uration t step = /v are specifie. The leg length l(t) = x (t) + y (t) has a maximum value l max uring the stance phase, an the leg also has a finite strength. The motion must also be perioic. Formally, the optimal control problem is minimize C = tstep E(s(t), F (t)) t mg cost subject to mẍ = F x/l t t s stance ynamics mÿ = F y/l mg t t s stance ynamics mẍ = t s < t t step swing ynamics mÿ = mg t s < t t step swing ynamics l(t) l max t t s leg length F (t) t t s leg strength < t s t step stance uration x(t step ) x() = istance travele y() y(t step ) ẋ() = ẋ(t step ). perioicity ẏ() ẏ(t step ) () The point mass has mass m, an g is the acceleration ue to Earth s gravity on Earth s surface. For the integran cost E, Srinivasan an Ruina use the positive work performe by the leg, E = [F (t) l(t)] + ; with this E, then C is the cost of transport, which is a imensionless measure of the energetic cost of traveling a certain istance. In their work, the ynamics constraints were enforce via forwar integration. In this work, we use m = 5kg, g = 9.8m/s, l max =.5m, = N. Different gaits are foun by varying the nonimensional step length D = /l max an spee V = v/ gl max. 3

. Methos Direct collocation We iscretize Problem into T step mesh points t, t,..., t Tstep = t step. The transition from stance to swing occurs at time inex i = T s ; that is, t s = t Ts T step. In orer to express the secon orer ynamics of the system using first orer ifferential equations, we introuce the variables w = ẋ an z = ẏ for the horizontal an vertical spee of the point mass. The iscretization yiels state variables s, s,..., s Tstep (with s i = (x i, y i, w i, z i )) an leg forces F, F,..., F Ts. We enforce the ifferential constraints using a numerical metho for solving orinary ifferential equations, namely the implicit Euler metho: for a ifferential equation ẏ = f(y, t), we obtain y via y i+ = y i +h i f(y i+, t i+ ). We use an implicit integrator because they are unconitionally stable (the solution won t blow up ). The resulting finite-imensional optimization problem is minimize C = T h i [E(s i+, F i+ ) + E(s i, F i )] i= subject to x i+ x i = h i w i+ i =,,..., T step y i+ yi = h i z i+ i =,,..., T step x i+ F i+ w i+ w i = h i i =,,..., T s ( m l i+ ) Fi+ y i+ z i+ z i = h i g i =,,..., T s m l i+ w i+ w i = i = T s,..., T step z i+ z i = h i g i = T s,..., T step li lmax i =,,..., T s F i i =,,..., T s < t s t step x Tstep x = y y Tstep w = w Tstep. z z Tstep () The scalars h i = t i+ t i are the integration step sizes. The performance cost is now approximate by the iscretize C, which is compute using the trapezoial rule. This problem has a total n = 4T step + T s + variables (4 states, the leg force uring stance, an t s ) an 4(T step ) ynamics constraints. For all results shown in this report, we use T s =, T step =. The value of T step is chosen to be sufficiently large so that the error in the ODE solution is below some tolerance, typically 3. For all of the constraints, the parameters at only one or two of the mesh points is present; therefore, this problem is very sparse. Since E may be non-convex, an since the thir an fourth constraints in Problem are non-affine, this problem is not convex. 4

Sequential quaratic programming We solve Problem using sequential quaratic programming [GMS94]. At each iteration k of the metho, we approximate the problem as a quaratic program about our current iterate q (k). The solution of this QP, q (k), is use to compute a search irection q (k) = q (k) q (k) along which to perform a line search. The result of this line search is the next iterate q (k+). Within the QP, the quaratic objective is q C (k) (q q(k) ) + (q q(k) ) qc (k) (q q(k) ), where q C (k) an qc (k) are the graient an Hessian evaluate at q (k), respectively. The only constraints that must be approximate for the QP are the thir an fourth from Problem. We efine the following: x i+ F i+ a i = w i+ w i h i, ( m l i+ Fi+ y i+ = z i+ z i h i b i m u i = (w i+, w i, x i+, y i+, F i+ ), v i = (z i+, z i, x i+, y i+, F i+ ). ) g, l i+ The quantities a i an b i are the resiuals in thir an fourth constraints, an u i an v i are the parameters containe in the thir an fourth constraints. The affine approximations to these constraints are a (k) i + ( ui a (k) i ) T (u i u (k) i ) = b (k) i + ( vi b (k) i ) T (v i v (k) i ) =. The graients in these equations are [ ( ( ui a i ) T F i+ x = h i+ i ) F i+ x i+ y i+ h m li+ 3 i [ l i+ ( m li+ 3 ( vi b i ) T F i+ x i+ y i+ F i+ y = h i h i+ m li+ 3 i ) m li+ 3 l i+ h i m h i m x i+ l i+ To solve the SQP, we use the SNOPT package [GMS5]. SNOPT is a package for solving large sparse non-convex problems, an approximates the Hessian using a BFGS upate. For the problems we explore, SNOPT requires between 5 to 35 iterations to converge; on a moern personal computer, this takes about minute. During the first few outer iterations, the QP solver takes potentially hunres of steps. After about 5 outer iterations, when the iterate is nearly feasible, the QP usually only requires two steps. Transcribing the problem The proceure of converting Problem into Problem is calle transcription. We use the PSOPT optimal control package to perform this transcription for us [Bec]. PSOPT also iteratively solves the problem on larger meshes until an ODE error tolerance is satisfie. 5 y i+ l i+ ] ].

b) Penular walk c) Impulsive run ) Penular run Leg-force F Leg-length L Inverte penulum e e e e τ step τ τ s τ step τ τ step τ Bounce Inverte Flight penulum Flight F' max F' max F' max τ τ τ τ s 3 Results Figure 3: Leg length an leg force for the three gaits prouce by [SR6]. Reproucing the results of Srinivasan an Ruina, 6 By varying the values of the non-imensional step length D an non-imensional spee V, Srinivasan an Ruina iscovere three istinct gaits (Figure ). Figure 3 shows the leg force F (t) an leg length l(t) for a specific instance of each of these three gaits. The penular walk emerges from small D an small V, the impulsive run emerges for all D (> ) with V above, an the hybri penular run emerges from D above for V <. We reprouce these results qualitatively using the methos presente in section.; the three gaits are shown in Figure 4. The force F is not smooth as a result of the non-smooth cost E. Notably, the penular walk oes not contain a swing phase, an the force for both penular gaits is U-shape. Exploring ifferent cost functions We explore three ifferent cost functions an analyze their effect on the resulting walking solution, similar to work one by [Sri]. For all three, we use D =.5 an V =.5. Minimize force over the gait cycle. With E = F (t), the solution minimizes the amount of force use throughout the whole motion. The solution shows that this causes the introuction of a swing phase that lasts as long as the stance phase. Minimize square leg force. The cost E = F (t) penalizes large forces. This objective might capture the esire to reuce joint loaing for those with joint pain. With this gait, the moel oes not change its height at all, an the leg force remains close to only what is necessary to counteract gravity; F (t) mg. Minimize gross work. The cost E = F (t) l(t) essentially allows the moel to propel itself using the energy absorbe from breaking. Inee, with this cost function, the net work over the gait is. The resulting motion is spring-like. 6

E =[F L] +, D =.5, V =.5 E =[F L] +, D =.5, V =.5 E =[F L] +, D =.5, V =.5.8 t step.8 t s t step.8 t step t step t s t step t step Figure 4: Three istinct gaits (left to right): penular walk, impulsive run, penular run. E =F, D =.5, V =.5 E =F, D =.5, V =.5 E =F L, D =.5, V =.5.8 t step.8 t step.6 t step t step t step t step Figure 5: Walking with ifferent costs (left to right): leg force, square leg force, gross work. 7

References [Bec] [Bet] V. M. Becerra. Solving complex optimal control problems at no cost with PSOPT. Proc. IEEE Multi-conference on Systems an Control, Yokohama, Japan, pages 39 396,. John Betts. Practical Methos for Optimal Control an Estimation using Nonlinear Programming. Society for Inustrial an Applie Mathematics, secon eition,. [GMS94] Philip E Gill, Walter Murray, an Michael A Sauners. Large-scale sqp methos an their application in trajectory optimization. In Computational optimal control, pages 9 4. Springer, 994. [GMS5] Philip E. Gill, Walter Murray, an Michael A. Sauners. Snopt: An SQP Algorithm For Large-Scale Constraine Optimization. SIAM Review, 47():99 3, 5. [SR6] [Sri] Manoj Srinivasan an Any Ruina. Computer optimization of a minimal bipe moel iscovers walking an running. Nature, 439(77):7 75, 6. Manoj Srinivasan. Fifteen observations on the structure of energy-minimizing gaits in many simple bipe moels. Journal of The Royal Society Interface, 8(54):74 98,. 8