A Trust-region-based Sequential Quadratic Programming Algorithm

Similar documents
Algorithms for Constrained Optimization

Accelerating interior point methods with GPUs for smart grid systems

Benchmarking of optimization methods for topology optimization problems

Single Shooting and ESDIRK Methods for adjoint-based optimization of an oil reservoir

5 Handling Constraints

MS&E 318 (CME 338) Large-Scale Numerical Optimization

An Inexact Sequential Quadratic Optimization Method for Nonlinear Optimization

Kommerciel arbejde med WAsP

A Note on Powers in Finite Fields

Modelling Salmonella transfer during grinding of meat

Infeasibility Detection and an Inexact Active-Set Method for Large-Scale Nonlinear Optimization

An Equivalent Source Method for Modelling the Lithospheric Magnetic Field Using Satellite and Airborne Magnetic Data

AM 205: lecture 19. Last time: Conditions for optimality Today: Newton s method for optimization, survey of optimization methods

1 Computing with constraints

Validation of Boundary Layer Winds from WRF Mesoscale Forecasts over Denmark

An Inexact Newton Method for Nonlinear Constrained Optimization

AM 205: lecture 19. Last time: Conditions for optimality, Newton s method for optimization Today: survey of optimization methods

Prediction of airfoil performance at high Reynolds numbers.

Optimizing Reliability using BECAS - an Open-Source Cross Section Analysis Tool

Radioactivity in the Risø District July-December 2013

Constrained Optimization and Lagrangian Duality

Validation and comparison of numerical wind atlas methods: the South African example

The dipole moment of a wall-charged void in a bulk dielectric

An Inexact Newton Method for Optimization

Generating the optimal magnetic field for magnetic refrigeration

Design of Robust AMB Controllers for Rotors Subjected to Varying and Uncertain Seal Forces

Convex Optimization & Lagrange Duality

Determination of Feasible Directions by Successive Quadratic Programming and Zoutendijk Algorithms: A Comparative Study

Reducing Uncertainty of Near-shore wind resource Estimates (RUNE) using wind lidars and mesoscale models

Radioactivity in the Risø District January-June 2014

Numerical modeling of the conduction and radiation heating in precision glass moulding

The colpitts oscillator family

A New Penalty-SQP Method

Higher-order spectral modelling of the diffraction force around a vertical circular cylinder

Room Allocation Optimisation at the Technical University of Denmark

arxiv: v1 [math.oc] 10 Apr 2017

Determination of the minimum size of a statistical representative volume element from a fibre-reinforced composite based on point pattern statistics

Dynamic Effects of Diabatization in Distillation Columns

Numerical Optimization Professor Horst Cerjak, Horst Bischof, Thomas Pock Mat Vis-Gra SS09

5 Quasi-Newton Methods

Numerical Methods for PDE-Constrained Optimization

Lidar calibration What s the problem?

A Procedure for Classification of Cup-Anemometers

Steady State Comparisons HAWC2 v12.2 vs HAWCStab2 v2.12

SF2822 Applied Nonlinear Optimization. Preparatory question. Lecture 9: Sequential quadratic programming. Anders Forsgren

Realtime 3D stress measurement in curing epoxy packaging

Matrix representation of a Neural Network

A comparison study of the two-bladed partial pitch turbine during normal operation and an extreme gust conditions

Algorithms for constrained local optimization

Determination of packaging induced 3D stress utilizing a piezocoefficient mapping device

Unknown input observer based scheme for detecting faults in a wind turbine converter Odgaard, Peter Fogh; Stoustrup, Jakob

Inexact Newton Methods and Nonlinear Constrained Optimization

Constrained optimization: direct methods (cont.)

Optimization 2. CS5240 Theoretical Foundations in Multimedia. Leow Wee Kheng

Diffusion of sulfuric acid in protective organic coatings

Trust Regions. Charles J. Geyer. March 27, 2013

arxiv: v1 [math.na] 8 Jun 2018


Global Wind Atlas validation and uncertainty

minimize x subject to (x 2)(x 4) u,

Optimisation in Higher Dimensions

Dynamic Asset Allocation - Identifying Regime Shifts in Financial Time Series to Build Robust Portfolios

Multiple Scenario Inversion of Reflection Seismic Prestack Data

Plane Wave Medical Ultrasound Imaging Using Adaptive Beamforming

Optimization Problems with Constraints - introduction to theory, numerical Methods and applications

POWER SYSTEMS in general are currently operating

On the Einstein-Stern model of rotational heat capacities

Algorithms for nonlinear programming problems II

Improving L-BFGS Initialization for Trust-Region Methods in Deep Learning

Comparisons of winds from satellite SAR, WRF and SCADA to characterize coastal gradients and wind farm wake effects at Anholt wind farm

Constrained Nonlinear Optimization Algorithms

Methods for Unconstrained Optimization Numerical Optimization Lectures 1-2

Constrained Optimization

Recent Adaptive Methods for Nonlinear Optimization

Crack Tip Flipping: A New Phenomenon yet to be Resolved in Ductile Plate Tearing

Nonlinear Programming

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84

Scientific Computing: An Introductory Survey

Scientific Computing: An Introductory Survey

Simulating wind energy resources with mesoscale models: Intercomparison of stateof-the-art

Geometry of power flows and convex-relaxed power flows in distribution networks with high penetration of renewables

Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization

Generalization to inequality constrained problem. Maximize

Graphs with given diameter maximizing the spectral radius van Dam, Edwin

Newton s Method. Ryan Tibshirani Convex Optimization /36-725

Control Configuration Selection for Multivariable Descriptor Systems Shaker, Hamid Reza; Stoustrup, Jakob

ISM206 Lecture Optimization of Nonlinear Objective with Linear Constraints

Sound absorption properties of a perforated plate and membrane ceiling element Nilsson, Anders C.; Rasmussen, Birgit

Validity of PEC Approximation for On-Body Propagation

Battling Bluetongue and Schmallenberg virus Local scale behavior of transmitting vectors

Optimal acid digestion for multi-element analysis of different waste matrices

Feasibility of non-linear simulation for Field II using an angular spectrum approach

Derivative-Based Numerical Method for Penalty-Barrier Nonlinear Programming

Optimization Methods

On the Local Quadratic Convergence of the Primal-Dual Augmented Lagrangian Method

A numerical study of vortex-induced vibrations (viv) in an elastic cantilever

Upper and Lower Bounds of Frequency Interval Gramians for a Class of Perturbed Linear Systems Shaker, Hamid Reza

Wake decay constant for the infinite wind turbine array Application of asymptotic speed deficit concept to existing engineering wake model

NUMERICAL OPTIMIZATION. J. Ch. Gilbert

Numerical Optimization

Transcription:

Downloaded from orbit.dtu.dk on: Oct 19, 2018 A Trust-region-based Sequential Quadratic Programming Algorithm Henriksen, Lars Christian; Poulsen, Niels Kjølstad Publication date: 2010 Document Version Publisher's PDF, also known as Version of record Link back to DTU Orbit Citation (APA): Henriksen, L. C., & Poulsen, N. K. (2010). A Trust-region-based Sequential Quadratic Programming Algorithm. Kgs. Lyngby: Technical University of Denmark, DTU Informatics, Building 321. IMM-Technical Report-2010-14 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

IMM - TECHNICAL REPORT - 2010-14 A Trust-region-based Sequential Quadratic Programming Algorithm L. C. Henriksen 1 and N. K. Poulsen 2 1 Wind Energy Division, Risø National Laboratory for Sustainable Energy, Technical University of Denmark, DK-4000 Roskilde, Denmark, larh@risoe.dtu.dk 2 Dept. of Informatics and Mathematical Modelling, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark, nkp@imm.dtu.dk ABSTRACT This technical note documents the trust-region-based sequential quadratic programming algorithm used in other works by the authors. The algorithm seeks to minimize a convex nonlinear cost function subject to linear inequalty constraints and nonlinear equality constraints. KEYWORDS nonlinear programming problem, sequential quadratic programming, trust region 1 Introduction A nonlinear cost function subject to nonlinear equality and inequalty constraints constitutes a nonlinear programming problem (NLP). Multiple methods for solving such problems exists, best known are sequential quadratic programming problem (SQP) [1, 2] algorithms and interior point (IP) algorithms [3, 4] or hybrids of the two [5]. The SQP is typically augmented with a trust region inequality constraint, which can be tightened of loosened depending on the progress made by the current search direction. More advanced versions of these algorithms are able to cope with nonlinear inequality constraints, a feature not yet tested with the algorithm presented in this paper. This paper describes a trust-region-based sequential quadratic programming (TRSQP) algorithm with a general framework which can be used for different problems with adaptations suited for the particular structure of the problem. Such a particular structure is seen in Nonlinear Model Predictive Control [6, 7, 8]. 2 Trust-region-based sequential quadratic programming A NLP of the general form has the Lagrangian min x f(x), s.t. c(x) = 0 and d(x) = 0 (1) L(x, ν, λ) = f(x) + ν T c(x) + λ T d(x) (2) leading to the Karush-Kuhn-Tucker (KKT) conditions for optimality [3, 4] L(x, ν, λ) = f(x) + c(x)ν + d(x)λ = 0 (3) c(x) = 0 (4) d(x) 0 (5) diag(d(x))diag(λ) = 0 (6) λ 0 (7) the KKT conditions will not be investigated further here as they more relevant for the IP methods. The NLP can be solved via an iterative procedure known as sequential quadratic programming (SQP) where 1

in each iteration the local approximated problem is solved as a quadratic programming problem (QP). min x m( x), m( x) = 1 2 xt H x + f(x) T x (8a) subject to c(x) + c(x) T x = 0 d(x) + d(x) T x 0 D x p δ (8b) (8c) (8d) The optimization variable is updated by the iterative progress (x, ν, λ) + = (x, ν, λ) + (x, ν, λ) (9) if certain step acceptance criteria are met. The extra inequality constraint (8d), known as a trust-region, where δ is the trust region radius, can be added to aid the convergence. Iterations continue either untill the maximum number of allowed iterations are met, if termination criteria are met or until the algorithm fails due e.g. bad numerical handling of the problem solving or because the problem was ill-posed to begin with. The outline of the TRSQP can be seen in Alg. 1. Algorithm 1: Trust region based sequential quadratic programming solver Set initial values for (x, ν, λ), δ = δ max and H = 2 f; for ITER from 1 to IMAX do Calc. trust-region scaling matrix D; Solve QP to obtain (x, ν, λ); Set trial variables (x, ν, λ) + = (x, ν, λ) + (x, ν, λ); Calc. f(x + ), c(x + ) and d(x + ) and their Jacobians; Calc. progress measures ρ and γ; Update trust region radius δ; Update Quasi-Newton approximation of Hessian of Lagrangian H; if Step accepted then Set (x, ν, λ) = (x, ν, λ) + ; if Convergence then Terminate algorithm; The different points of the algorithm are elaborated in the following sections. 2.1 Step acceptance Step acceptance is depending on two measures of progress as well as inequality feasibility. The actual to predicted cost reduction ratio ρ = f(x) f(x+ ) m( x) (10) provides a measure of how well the QP subproblem resembles the properties of the NLP at the current point. For ρ 1, the QP and the NLP are in good agreement. For ρ > 1, a greater decrease in cost function than predicted by the QP has occured and for 0 < ρ < 1 the actual decrease in cost function was not as good as predicted by the QP. For ρ < 0, the NLP and the QP are not in agreement of whether the cost function was decreased or increased with the current step. The relative improvement of the cost function γ = f(x) f(x+ ) f(x) (11) 2

(Solid) D x = 1 (Dashed) D x 2 = 1 2 1 x opt x2 0-1 -2-2 -1 0 1 2 x 1 Figure 1: Two trust regions are shown in this figure: The box is given by the -norm and the ellipsiod is given by the 2-norm. Both norms have scaling matrices based on the Hessian. provides another measure of progress. Different strategies can be taken. In the present work the cost function is allowed to increasee as long as ρ or γ are positive and inequalties are feasible. This enables a search where the algorithm is able to move around. The step is accepted if the QP was able to find a feasible solution and if max(ρ, γ) > 0 (12) max(d(x + )) τ d(x) (13) furthermore steps are not accepted in the first iteration as experience has shown better performance by letting the algorithm start up and generate its first Quasi-Newton Hessian approximiation etc. 2.2 Trust region A trust region with the general form D x p δ (14) where D is the scaling matrix, p is the number of the norm, e.g. 1,2 or, and δ is the trust region radius, can be imposed on the QP. The first choice of a suitable scaling matrix D might be the identity matrix. An even better choice takes the Hessian H into account as the problem might not be equally sensitive to changes of x in all directions. The 2-norm yields a quadratic constraint x T D T D x δ 2 which can be chosen to be indentical to the Hessian x T H x δ 2. The quadratic constraints would make the approximated problem a quadratic constrained quadratic cost (QCQC) problem which does not fit into the normal QP framework used by the SQP, where only linear constraints occur. The -norm results in linear inequalty constraints δ D x δ (15) and is thus suitable for the description of a trust region which can be included in standard QP. Inspired by the quadratic constraints the Hessian can be decomposed by e.g. singular value decomposition H = UΣV T = UΣ 1/2 Σ 1/2 V T = D T D (16) giving a multidimensional box circumscribing the ellipsoid of the quadratic constraint as tight as possible. Fig. 1 shows how the -norm and the 2-norm trust regions resemble the Hessian and ensure that steps are constrained along the dimensions in a fashion scaled by the Hessian. 3

2.3 Trust region radius The trust region radius should be increased or decreased according to a set of rules: If ρ is small indicating poor agreement between the NLP and QP or if γ is negative indicating an increase in cost function or if the previous step has been rejected for some reason, then the trust region radius should be decreased. If the QP failed, it is most likely due to a too restrictive trust region radius and the radius should be increased. If ρ was high, indicating good agreement between the NLP and the QP, then the trust region radius should be increased. δ/2 for ρ < 1/4 or γ < 0 or Step Rejected δ = min(3δ, δ max ) for QP failed min(3 min(δ, D x p ), δ max ) for ρ > 3/4 (17) 2.4 Quasi-Newton approximations of Hessian of Lagrangian The quadratic cost in the QP should idealy by the Hessian of the Lagrangian of the NLP n c n d 2 L = 2 f(x) + 2 c i (x)ν i + 2 d i (x)λ i (18) i this would computational expensive and if no analytic second derivatives of the cost function and constraints are available, those would have to be determimed via finite differences. Commonly used alternatives are different Quasi-Newton approximations such as dbfgs (Alg. 2), SR1 (Alg. 3) and SR1pos (Alg. 4). They are all calculations based on the variable step s and and with the difference in the gradients of the Lagrangian y i s = x + x (19) y = L(x +, ν +, λ + ) L(x, ν +, λ + ) (20) Algorithm 2: damped Broyden-Fletcher-Goldfarb-Shanno (dbfgs) update if s T y < 0.2s T Hs then θ = 0.8s T Hs(s T Hs s T y) 1 ; y = θy + (1 θ)hs; H = H Hss T H(s T Hs) 1 + yy T (y T s) 1 ; Algorithm 3: Symmetric rank-1 (SR1) update if (y Hs) < 10 6 (y Hs)s then H = H + (y Hs)(y Hs) T ((y Hs) T s) 1 ; Algorithm 4: Positive definite symmetric rank-1 (SR1pos) update if (y Hs) < 10 6 (y Hs)s then H = H + (y Hs)(y Hs) T ((y Hs) T s) 1 ; ΛV = HV; H = VΛ + V 1 ; The dbfgs and SR1pos both maintain a positive definite Hessian, making the QP easier to solve. The SR1 might be a better fit if the NLP is not positive definite. 4

2.5 Termination The algorithm terminates if steps toward the optimum are becomming to small and if the last accepted step was feasible x 2 τ x (21) max(d(x + )) τ d(x) (22) c(x + ) τ c(x) (23) 3 Discussion and future work The presented algorithm has been tested with Nonlinear Model Predictive Control, with nonlinear equality constraints [6]. It has not been tested with nonlinear inequality constraints and performance with nonlinear inequality constraints remains to be investigated. References [1] Richard H. Byrd, Jean Charles Gilbert, and Jorge Nocedal. A trust region method based on interior point techniques for nonlinear programming. Mathematical Programming, 89(1):, 2000. [2] Stephen J. Wright and Matthew J. Tenny. A feasible trust-region sequential quadratic programming algorithm. SIAM journal on optimization, 14(4):1074 1105, 2004. [3] Jorge Nocedal and Stephen J. Wright. Numerical Optimization. Springer, 2nd edition, 2006. [4] Stephen Boyd and Lieven Vandenberghe. Convex Optimization. Cambridge University Press, 2004. [5] R.A. Waltz, J. Nocedal, D. Orban, and J.L. Morales. An interior algorithm for nonlinear optimization that combines line search and trust region steps. Math. Program., 107(3):391 408, 2006. [6] L. C. Henriksen, N. K. Poulsen, and M. H. Hansen. Nonlinear model predictive control of a simplified wind turbine. Submitted for Proc. World Congr. Int. Fed. Autom. Control., IFAC, pages, 2011. [7] L. C. Henriksen and N. K. Poulsen. An online re-linearization scheme suited for model predictive or linear quadratic control. IMM-Technical Report 2010-13, Dept. of Informatics and Mathematical Modelling, Technical University of Denmark, 2010. [8] Matthew J. Tenny, Stephen J. Wright, and James B. Rawlings. Nonlinear model predictive control via feasibility-perturbed sequential quadratic programming. Computational Optimization and Applications, 28(1):87 121, 2004. 5