Information Recovery in Composite Model Reference Adaptive Control

Metehan Yayla; Ali Turker Kutay

doi:10.5772/intechopen.1005440

Abstract

This study introduces a new adaptive control approach incorporating frequency-limited estimation of matched uncertainty. While many existing adaptive parameter adjustment laws aim to mitigate uncertainty effects solely through tracking error, it is well-documented that integrating uncertainty estimation error into the adaptation process significantly improves transient performance. Our method incorporates low-frequency uncertainty estimation with a time-varying learning rate structure. Unlike conventional filter-based approaches, our approach also compensates for information loss during signal filtering to suppress high-frequency content. Additionally, we include a regulation term in the standard adaptive weight update law, acting as stability enhancement in the adaptive system. We demonstrate the closed-loop stability of the proposed method using Lyapunov’s stability theorem and highlight its efficacy through numerical examples and software-in-the-loop simulations with the X-plane flight simulator.

Keywords

filter
uncertainty
transients
fast adaptation
robustness

Author Information

Show +

Metehan Yayla*
- Esen System Integration LLC, Ankara, Turkey
Ali Turker Kutay
- Department of Aerospace Engineering, Middle East Technical University, Ankara, Turkey

*Address all correspondence to: myayla@metu.edu.tr

1. Introduction

Model Reference Adaptive Control (MRAC) frameworks achieve desired levels of closed-loop stability and performance for uncertain dynamical systems through online adaptive weight update laws. In the absence of restrictive persistent excitation (PE) of system signals, the standard MRAC framework cannot assure closed-loop stability when facing bounded perturbations using only instantaneous data [1]. To enhance the robustness of standard MRAC or ensure stability without PE, robust modifications are introduced in literature [2, 3, 4, 5, 6, 7]. While these modifications guarantee the robustness of the adaptive controller, they generally yield limited enhancements in the transient response, primarily due to their focus on incorporating only the tracking error in the adaptation. Nonetheless, there exist a few additional modifications, such as those detailed in Refs. [8, 9], which notably improve transient performance by extracting additional information on uncertainty estimation error. Inspired by these studies, various modifications have been proposed in the literature, including but not limited to [10, 11, 12, 13, 14, 15, 16]. In these studies, information regarding either uncertainty estimation error or parameter convergence error is extracted using low-pass filters. However, filtering system signals may lead to information loss and degradation of adaptation performance.

To improve the tracking performance and achieve fast adaptation, it is desired to set the adaptive gains as high as possible. However, it is also well-practiced that the high learning rates can excite (possibly unmodeled) high-frequency modes of the system and result in system instability due to high-frequency oscillations. Numerous studies in the literature aim to facilitate the utilization of high gains while ensuring stability, such as the optimal control modification [5] and the low-frequency learning modification [17]. Yucelen and Haddad [17] introduced a modification term in the standard adaptive law to constrain the estimated parameters within bounds around their low-pass filtered signals. In the optimal control modification [5], high adaptive gains are permitted, albeit at the cost of reduced performance concerning robustness. Unlike these modifications, where the learning rate remains constant, there are studies where the adaptation gain varies over time [16, 18]. However, the learning rate vanishes in these methods if the basis function is not sufficiently and/or persistently exciting, causing the termination of adaptation.

So, our motivation becomes to combine these approaches in a way that the modification term allows the use of high adaptation gains without causing any high-frequency oscillations, and the learning rate of the proposed modification is adjusted to prevent undesired behavior during the adaptation.

In this study, we propose a new adaptive law that offers a modification term for information recovery in filter-based adaptive controllers. The proposed modification achieves fast adaptation by adding a new direction in adaptation using high-frequency content of the filtered signals. In addition, a time-varying learning rate is introduced to suppress the undesired effects of high-frequency signals on the system behavior. In this way, the proposed modification term behaves like a stability augmentation system to standard adaptive law. It should be noted that the proposed study builds upon the foundation established in the author’s earlier work [19] by presenting more rigorous theoretical results and introducing additional simulations that highlight the effectiveness of the proposed method. Furthermore, the contributions are extended to cover a wider class of uncertain systems where limited information exists on uncertainty. These improvements result in a clearer and more precise presentation of the research, while expanding the domain of applications with new theoretical results through unstructured uncertainty parameterization.

2. Notation and definitions

The notation used in this study is fairly standard. Specifically, R denotes the set of real numbers, Rn denotes the set of n×1 real column vectors, Rn×m denotes the set of n×m real matrices, R+ denotes the set of strictly positive real numbers, ⋅−1 denotes inverse, ⋅T denotes transpose, ‘≜’ denotes equality by definition, A≻0 denotes that A is a positive definite matrix, ⋅† denotes the Moore–Penrose inverse, ‘vec⋅’ denotes the column stacking operator, ‘tr⋅’ denotes the trace operator, λminAλmaxA returns the minimum eigenvalue (the maximum eigenvalue) of matrix A, σmaxAσminA returns the maximum (minimum) singular value of matrix A, and ∣a∣ denotes the absolute value of scalar a∈R. Furthermore, for the vector x∈Rn and matrix A∈Rm×n, the Euclidean vector norm x, induced matrix norm A, and Frobenius matrix norm AF are defined as:

x≜∑i=1nxi2,A≜λmaxATA=σmaxA,AF≜trATA.E1

Kronecker product is denoted by ‘⊗’. For the vector θ∈Rk and convex function f:Rk→R, the gradient operator ∇fθ is ∇fθ=∂fθ∂θ1⋯∂fθ∂θkT. Lastly, ‘■’ indicates the completion of a mathematical proof.

Definition 1.1 (Persistent Excitation (PE)) The signalωt∈Rnis said to be persistently exciting over an excitation period ofτif and only if there exist positive constantsτ,β∈R+such that the following inequality holds:

∫t−τtωsωTsds≽βIn×n,∀t∈R+E2

for all time∀t∈R+whereIn×nis the identity matrix of dimension n.

Definition 1.2 (Interval Excitation (IE)) The signalωt∈Rnis said to be interval exciting at timet=Teover an excitation period ofτif there exist positive constantsτ,β∈R+such that the following inequality holds:

∫t−τtωsωTsds≽βIn×n,t=Te∈R+E3

Remark 1.1. One can realize that the interval excitation (IE) is far less demanding condition than the persistent excitation (PE) as IE condition must be satisfied for all time∀t∈R+to satisfy the PE condition. That is, persistency of excitation is uniform in time, whereas the interval excitation just holds for a particular time interval.

3. Problem definition

In this section, the control system description is provided first. Then, standard MRAC and composite/combined MRAC solutions are briefly reviewed for the completeness of the chapter.

3.1 System description

Consider the following nonlinear uncertain dynamical system:

ẋt=Axt+But+Δx,xt0=x0E4

where x∈Dx⊂Rn is the state vector with Dx being a sufficiently large compact set, u∈Du⊂Rm is the m-dimensional control input that lies in the admissible control set Du, Δ:Rn→Rm is the map representing the unknown uncertainty, A∈Rn×n is known system matrix. Furthermore, input matrix B∈Rn×m has full column rank and is assumed to be known (consistent with the literature; for example [6, 7, 14, 20, 21]), which is necessary to compute low-frequency content of the uncertainty using its pseudo-inverse (detailed later in Section 3.3). The fundamental assumptions are: (i) the pair (A, B) is controllable, (ii) full state measurement is available for feedback.

Assumption 1.1 (Structured Uncertainty). The uncertaintyΔxcan be expressed as linear combination of known basis functionsϕ:Dx→Rsas follows:

Δx=WTϕxE5

whereW∈Rs×mis the unknown constant weight matrix with∥W∥≤w¯, w¯>0, and known functionϕxis Lipschitz continuous overDx.

Assumption 1.2 (Unstructured Uncertainty). Assumption 1.1 can be relaxed using universal approximators such as Single Hidden Layer Neural Networks (SHL-NN) [22] or Radial Basis Functions (RBF) [23] as follows:

Δx=WTϕx+εxE6

where the residualεxwith∥εx∥≤ε0, ∀x∈Dx.

Remark 1.2. The basis functionϕxin Assumption 1.2 can be chosen such that the residualεxcan be made arbitrarily small on a compact setDxwith a positive scalar upper boundε0. Various methods exist for selecting such appropriate basis functions as universal approximators. One such method is to employ Radial Basis Functions (RBF) as suggested by the universal approximation theorem [23,24] while minimizing the approximation errorεx. Due to its simplicity and extensive use in the literature (for example [25–27]), we parameterize the unstructured uncertainty inEq. (6)using RBFs given by:

ϕix=1,i=1exp−xt−c¯i22μ¯i2,i=2,⋯,sE7

with c¯i∈Rn and μ¯i∈R+ being the center and width of the i^th node, respectively [23]. It should be noted that in the structured uncertainty case, the basis ϕ could be any known function and does not necessarily consist of Gaussian kernels.

The primary goal in model reference adaptive control (MRAC) is to design a control input to ensure that the system trajectories track those of the reference model given by:

ẋrt=Arxrt+Brrt,xrt0=xr0E8

with Ar∈Rn×n being Hurwitz reference system matrix and Br∈Rn×k being reference input matrix. Positive-definite matrix P∈Rn×n satisfies the following Lyapunov equation

ArTP+PAr=−Q,P=PT≻0E9

for any user-defined positive definite matrix Q=QT≻0. The reference state vector and reference command are denoted by xr∈Dx and r∈Rk, respectively.

Assumption 1.3 (Matching Conditions). As a standard assumption in adaptive control [28–30], the reference model satisfies the matching conditions. That is, there exist matricesKxandKrsuch that the following conditions hold:

Ar=A−BKxBr=BKrE10

The nominal control input unt, consisting of feedback and feedforward components, is designed by setting Δx=0 in the plant dynamics in Eq. (4) as follows:

unt=−Kxxt+KrrtE11

The adaptive controller augmented control input is given by:

ut=unt+uat=−Kxxt+Krrt−ŴTtϕxE12

with Ŵt being the online estimation of unknown weight W.

3.2 Standard MRAC review

Let the tracking error be defined as et≜xrt−xt. Combining the control in Eq. (12), matched uncertainty in Eq. (5), reference model in Eq. (8), and uncertain system dynamics in Eq. (4) with structured uncertainty parameterization yields the following tracking error dynamics:

ėt=Aret−BW˜TtϕxE13

with W˜≜W−Ŵ being the weight estimation error. In the standard adaptive control formulation, the baseline weight update law is given by:

Standard MRACLaw:Ŵ̇=−ΓϕeTPBE14

with Γ≻0 being positive definite learning rate. The closed-loop stability of the standard adaptive control system can be shown using radially unbounded Lyapunov function VeW˜=eTPe+trW˜TΓ−1W˜>0 with V̇eW˜=−eTQe≤0. Note that V00=0 and VeW˜>0, for all eW˜≠00, ∀t∈R+. Since VeW˜ is lower-bounded by zero, and its derivative V̇eW˜≤0 is less than or equal to zero, Lyapunov function VeW˜ approaches to a finite limit as t→∞. Hence, the boundedness of tracking error e(t) and weight estimation error W˜t is guaranteed. Since e(t) and xrt are bounded, system states x(t) and basis vector-function ϕx immediately become bounded. With bounded et,xt,ϕx, and W˜t, time derivative of tracking error dynamics ėt becomes bounded, which ensures the boundedness of V¨eW˜=−2eTQė, ∀t∈R+. It then follows from Barbalat’s Lemma [31] that limt→∞V̇etW˜t=0 which implies the asymptotic stability of tracking error e(t); that is, limt→∞et=0. If the basis vector-function ϕx is persistently exciting, then the estimated parameters converge to their ideal values; that is, Ŵt→W as t→∞. See Ref. [29] for details.

Remark 1.3. If the uncertaintyΔxis characterized as unstructured uncertainty as in Assumption 1.2, the tracking error dynamics become:

ėt=Aret−BW˜Ttϕx−BεxE15

In this case, the boundedness of the estimated parameters is not guaranteed by the standard MRAC without persistently excitingϕx. Several robust modifications have been introduced in the literature to enhance the robustness, including but not limited toσ-mod [2], e-mod [3], optimal control-based modification [5], Kalman filter-based modification [7], and q-modification [9]. Furthermore, it is common practice to include the projection operator [32] to bound the parameters within the prescribed convex set.

Remark 1.4. Consider the adaptive law withσ-modification as a solution to the stability issue outlined in Remark 1.3:

Ŵ̇=−ΓϕeTPB+σŴE16

where σ∈R+ being constant scalar design parameter. It can be shown using Lyapunov function V=12eTPe+12trW˜TΓ−1W˜ that the closed-loop system is stable in the sense of Lyapunov, even if εx≠0 without requiring persistent excitation of the basis function ϕ. Specifically, the error signals e and W˜ are ensured to be inside the compact set Dη which is defined as Dη≜ηt:∥η∥<μ2μ1 with μ1,μ2,η being as the followings:

μ1≜min12λminQ−c1σ1−c2μ2≜∥PB∥2ε024c1+σw¯4c2c1∈012λminQ,c2∈01η≜eTvecW˜TTE17

3.3 Composite/combined MRAC review

Although high gains enable fast adaptation in standard MRAC, it may degrade the transient performance by inducing high-frequency oscillations in the system and even cause instabilities in the presence of large uncertainties. Improving the transient performance in MRAC has always been an attractive problem [5, 11, 17, 20, 21, 33, 34]. As seen from these studies and references therein, including the uncertainty estimation error in the adaptation enhances transient performance significantly. This fact was first utilized in [8], called ‘Composite Model Reference Adaptive Control’. Nearly at the same time, a quite similar approach was introduced [35] as ‘Combined Model Reference Adaptive Control’. Later in [20], these two approaches are generalized to cover the multi-input multi-output systems. These methods are referred in this study as composite/combined model reference adaptive control (CMRAC) to acknowledge both studies. Basically, CMRAC starts with filtering the system dynamics in Eq. (4) as:

ẋf=Axf+Buf+ΔfE18

with subscript ‘f’ indicating that the signal is filtered through the following low-pass filters:

ẋf=ωfx−xf,xft0=x0u̇f=ωfu−uf,uft0=u0ϕ̇f=ωfϕ−ϕf,ϕft0=ϕx0E19

where ωf∈R+ is the cut-off frequency that determines the bandwidth of low-pass filters. Rearranging Eq. (18) yields:

Δf=B†ẋf−Axf−ufE20

Note that the signals xf, uf, ϕf, ẋf and ϕ̇f are all accessible through Eq. (19). It should also be noted that Δf can be expressed as Δf=WTϕf+εf from Eq. (5). Hence, we get

WTϕf+εf=B†ẋf−Axf−ufE21

Once the low-pass filters are applied to the system dynamics, the low-frequency content of uncertainty, Δf, becomes an available signal for control purposes, as all the signals on the right-hand side of Eq. (20) are available. Frequency-limited estimation of the uncertainty is defined as:

Δ̂f=ŴTϕfE22

Next, one can define the low-frequency uncertainty estimation error as:

Δ˜f≜Δf−Δ̂fE23

=B†ẋf−Axf−uf−ŴTϕfE24

Then, the modified update law in CMRAC is given by

CMRACLaw:Ŵ̇=Γ−ϕeTPB+γcϕfΔ˜fTE25

where the error signal Δ˜f is available for feedback in control through Eq. (24) and γc∈R+ is a positive scalar learning rate for CMRAC. It should be noted that CMRAC modification term is gradient descent-based minimizing solution of the following optimization problem:

minŴJJŴ=12∥Δf−Δ̂f∥2=12∥Δ˜f∥2E26

where the gradient of the cost function in Eq. (26) is given by:

∇JŴŴ=−ϕfΔfx−ŴTtϕfxT=−ϕfΔ˜fTE27

Hence, the CMRAC weight update law can also be written as follows:

CMRACLaw:Ŵ̇=Γ−ϕeTPB−γc∇JŴŴE28

In this section, we revisit the CMRAC formulation assuming structured uncertainty definition as this is the case in contributing reference studies [8, 20, 36], which implies εx=εfxt=0. Then, the asymptotic stability of the closed-loop system with CMRAC weight update law in Eq. (25) can be shown with radially unbounded Lyapunov function VeW˜=12eTPe+12trW˜TΓ−1W˜. Its time-derivative along the system trajectories in Eq. (13) and Eq. (25) is given by V̇eW˜=−12eTQe−γctrW˜TϕfϕfTW˜ which is also equal to V̇eW˜=−12eTQe−γc∥Δ˜f∥2. With this result, boundedness of the Lyapunov function V is guaranteed, which implies the boundedness of the state tracking error e(t) and weight estimation error W˜t. Next, following the similar discussions made in MRAC, one can show the boundedness of V¨eW˜=−12eTQė−γcΔ˜fT−Ŵ̇Tϕf+W˜Tϕ̇f using Eqs. (19) and (25). It then follows from Barbalat’s Lemma [31] that limt→∞V̇etW˜t=0 which implies the asymptotic stability of the state tracking error e(t) and filtered uncertainty estimation error Δ˜ft; that is, limt→∞et=0 and limt→∞Δ˜ft=0. It should be emphasized that the parameter convergence is still not guaranteed with CMRAC since the asymptotic stability of Δ˜f does not imply the asymptotic stability of weight estimation error W˜. Furthermore, in case of unstructured uncertainty with nonzero residual εx≠0, the boundedness of adaptive weights is not guaranteed without persistent excitation of ϕx as in standard MRAC formulation, and the comments in Remark 1.3 apply to the CMRAC, as well.

Remark 1.5. Cut-off frequency,ωf, inEq. (19)should be chosen carefully to reject the measurement noise and high-frequency content in the uncertainty. Although too smallωfsuccessfully filters the high-frequency signals in the adaptation, it may degrade the adaptation performance sinceŴresponse becomes significantly sluggish [36].

4. Main result

Although CMRAC and its further modifications (for example [10, 12, 13, 14, 21, 37]) contributed a lot to the robustness and performance of the standard MRAC, all these filtering-based solutions suffer from losing information during filtering. As emphasized in Remark 1.5, the performance degradation due to lost information may reach significant levels due to a poor selection of filter bandwidth. If the system experiences high-frequency variations (for example, payload drop for an aircraft, actuator failures), sluggish evolution of the adaptive weights may not guarantee the desired tracking performance. Furthermore, a relatively small cut-off frequency ωf may delay the adaptation due to time-delay nature of low-pass filters. In this study, we address these issues and propose a new model reference adaptive control architecture that compensates for the information lost during filtering by including the high-frequency content of the filtered signals. In addition, a time-varying learning rate is also proposed to suppress the undesired effects of high-frequency signals in the adaptation.

Assumption 1.4 (Interval Excitation). Basisϕ∈Rsis an interval exciting signal. Hence, there exist constant scalarsτ,α,Te∈R+such that the following inequality holds:

∫t−τtϕsϕTsds≽αIs×s,t=TeE29

Remark 1.6. Assumption 1.4 ensures that the basis functionϕcontains as many spectral lines as the number of unknown parameters within time intervalt∈Te−τwTe. Note that the basis functionϕis an exogenous signal to the low-pass filter system inEq. (19). Then, the filtered basis functionϕfalso has the same number of spectral lines with less energy than that of the original basisϕ. This implies the following:

∫t−τtϕfsϕfTsds≽βIn×n,t=TeE30

with 0<β≤α. Let the transfer function representing the linear filter dynamics be Gs≜ωfs+ωf. Then, the degradation in the richness of the signal (which is equivalent to how small is β than α) depends on the cut-off frequency ωf and spectrum of the basis ϕ. Let the spectral measure of ϕ is given by Sϕω1ω2 with ω1<ω2. Choosing the filter design parameter ωf to be larger than ω2 leads to relatively small degradation in the richness of ϕf with β being closer to α.

4.1 Integrated uncertainty estimation error

We further manipulate the low-frequency content of the matched uncertainty Δf by defining the following integrals over a moving window with a length of τω seconds:

Mω≜∫t−τωtϕfΔfTdξ=∫t−τωtϕfϕfTdξW+∫t−τωtϕfεfTdξM̂ω≜∫t−τwtϕfϕfTdξΘ̂=ΦωΘ̂E31

where Θ̂∈Rs×m is an auxiliary weight, which will be discussed in Section 4.2.

Although ideal weight matrix W and filtered residual εf are unknown, the low-frequency content of the uncertainty, Δf, is accessible from Eq. (22). Hence, the integrated signal Mω can be computed during online operations. Next, the corresponding error M˜ω is defined as the following:

M˜ω≜Mω−M̂ω=∫t−τωtϕfϕfTdξΘ˜+∫t−τωtεfϕfTdξ=ΦwΘ˜+∫t−τωtεfϕfTdξE32

with Θ˜≜W−Θ̂ being an auxiliary weight estimation error. It should also be noted that the signal M˜ω is also available since both Mω and M̂ω are accessible online. In order to preserve the information in excited basis functions, temporarily frozen variables M and M̂ are defined as follows:

M˜tΦt≜M˜ωtΦωt,ifκΦωt<κΦt−M˜t−Φt−,otherwiseE33

where M˜t− and ϕt− have zero initial conditions of appropriate dimensions. Additionally, κ⋅ returns the condition number of its argument. With the definition in Eq. (33), it is ensured that the condition number of Φt is monotonically non-increasing.

4.2 Auxiliary weight update law

In this section, we construct the first layer of dual adaptive law using the integrated signals in Section 4.1. The main purpose of the auxiliary adaptive law is to increase the stiffness of primary update law (which will be detailed in Section 4.3) and ensure the boundedness of adaptive weights Ŵ when Δx is parameterized as unstructured uncertainty; i.e., εx≠0. Additionally, the auxiliary adaptation guarantees the parameter convergence without PE if the uncertainty is structured and Assumption 1.4 on interval excitation holds; i.e., W˜→0 as t→∞ when εx=0. In order to construct the auxiliary weight update law, we begin with the following optimization problem:

minŴC,CŴ=12∥M−M̂∥2E34

where the corresponding gradient becomes:

∇CŴŴ=−ΦM−M̂=−ΦΦΘ˜+ϵE35

with ϵ=M−ΦW. Then, the adaptive law for the auxiliary weights θ̂ is constructed using gradient descent-based optimization as:

Θ̂̇=ΓΘΦM˜,Θ̂t0=Θ̂0E36

with ΓΘ≻0 being positive definite gain matrix of appropriate dimensions.

Theorem 1.1. Consider the uncertain dynamical system inEq. (4)with unstructured uncertainty characterization inEq. (6), auxiliary weight update law inEq. (36), and integrated frozen signals inEq. (33). Then, auxiliary weight estimation errorΘ˜is uniformly ultimately bounded. Furthermore, if Assumption 1.4 on interval excitation of the basis functionϕis satisfied, the estimated auxiliary weights converge to a closer neighborhood of ideal weight W.

Proof: Consider the following Lyapunov function:

VΘ=12trΘ˜TΓΘ−1Θ˜E37

Its time-derivative along the trajectory in Eq. (36) is obtained as:

V̇Θ=−trΘ˜TΦM˜=−trΘ˜TΦΦΘ˜−trΘ˜TΦϵ=−∥ΦΘ˜∥F2−trΘ˜TΦϵE38

If Assumption 1.4 is not satisfied, Φt=0 holds due to Eq. (33) with zero initial condition Φt0=0, which results in V̇Θ=0. Thus, the adaptation for the auxiliary weights vanishes and Θ̂ stays at its initial condition, i.e. Θ̂̇=0, Θ̂t=Θ̂0. Hence, the boundedness of auxiliary parameters is guaranteed if Assumption 1.4 does not hold. From this point forward, let us investigate the case where the interval excitation assumption holds. Note that, at any time t, the integrated residual error ϵ for the unstructured uncertainty case can be bounded from above as:

∫t−τwtεfϕfTdξ≤∫t−τwtεfϕfTdξ≤∫t−τwtεfϕfTdξ≤∫t−τwtε0sdξ≤ε0τws,∀t∈R+E39

by utilizing the Gaussian kernels as radial basis functions for ϕ∈Rs. Then, the time-derivative of the Lyapunov function can be bounded as

V̇Θ≤−∥ΦΘ˜∥F2+c1∥Φθ˜∥F2+14c1∥ϵ∥2=−1−c1∥ΦΘ˜∥F2+c22E40

with c2≜ε0τws2c1. Then, its is clear that for any c1∈01, the matrix product ΦΘ˜ is bounded since V̇Θ<0 if ∥ΦΘ˜∥F>c21/1−c1. As Assumption 1.4 is satisfied, the matrix Φ is positive definite with λminΦ≥β from Remark 1.6. The boundedness of ΦΘ˜ with an invertible Φ guarantees the boundedness of auxiliary parameter estimation error Θ˜, since Θ˜=Φ−1ΦΘ˜, that is, ∥Θ˜∥≤d1 with d1>0 being unknown positive constant that depends on the excitation level β, residual bound ε0, and number of RBF neurons s. With constant unknown weight matrix W and Θ˜∈L∞, the boundedness of auxiliary weight estimation Θ̂ is ensured, as well. ■

Corollary 1.1. Consider the control system outlined in Theorem 1.1. With structured uncertainty parameterization inEq. (5), auxiliary weightsΘ̂converge exponentially to the ideal weights W if Assumption 1.4 holds.

Proof: Structured uncertainty parameterization implies εx=ϵ=0. Hence, Lyapunov derivative becomes:

V̇Θ=−∥ΦΘ˜∥F2≤−λminΦ2∥Θ˜∥2≤−β2∥Θ˜∥2<0E41

with β being excitation level in Remark 1.6. Quadratic positive definite Lyapunov function with quadratic negative definite derivative ensures the exponential convergence of the auxiliary weight convergence error Θ˜ at a rate of β2; that is, ∥Θ˜t∥≤∥Θ˜t0∥e−β2t−t0≜d2t≤d¯2 with d¯2∈R+ being an unknown constant scalar. Since the Lyapunov function is radially unbounded, this result holds globally.

Remark 1.7. Unlike the minimum singular value maximization algorithm [38], the condition number minimization is utilized inEq. (33). The main reason is that the deviation from the ideal solution W in least-squares regression due to perturbationεis directly proportional to the condition number of the regressor matrixΦ. Therefore, minimizing the condition number ofΦimproves the robustness against residual errorεx. Furthermore, maximizing the minimum singular value is inherently embedded in condition number minimization due to the definitionκ⋅=λmax⋅λmin⋅. It should be emphasized that the condition number minimization also introduces additional information ofλmaxΦon temporarily frozen matrices, which results in more frequent update, more robust adaptation, and faster development of weight estimations. Further discussion on condition number minimizing algorithm can be found in [39].

4.3 Primary layer of adaptation

In this section, the primary weight update law is introduced for the dual adaptive controller. We start by decomposing the weight update law as:

Ŵ̇=Ŵ̇b+Ŵ̇m,withŴ̇b=−ΓϕeTPBE42

where Ŵ̇b is the standard MRAC law in Eq. (14) and Ŵ̇m is the proposed modification term given as the following:

Ŵ̇m=Γ−1+γmϕfϕfT−1γmϕfϕfTΓϕeTPB+γmϕ̇fΔ˜fT+γ1ϕfΔ˜fT+γ2Θ̂−ŴE43

where γ1,γ2,γm∈R+ are user-defined design variables. Then, the proposed weight update law can be written as

Ŵ̇=−XϕeTPB+Xγ1ϕfΔ˜fT+γ2Θ̂−Ŵ+γmϕ̇fΔ˜fTE44

where the time-varying learning rate X is given by

X≜Γ−1+γmϕfϕfT−1E45

Lemma 1.1. The Learning rateXis a positive definite matrix.

Proof:

X≻0⇒Γ−1+γmϕfϕfT−1≻0⇒Γ−1+γmϕfϕfT≻0since′ϕfϕfT≽0'andγm>0⇒Γ−1≻0⇒Γ≻0E46

Since Γ≻0 by the design, the claim X≻0 holds, as well.

Remark 1.8. The termγ2Θ̂−Ŵis essentially aσ-modification term introduced to the adaptive law, causing the estimated weightŴto remain bounded around auxiliary weightΘ̂. Recall thatΘ̂is bounded from Theorem 1.1. Hence, the stability results from standard MRAC withσ-modification in Remark 1.4 can directly be applied to ensure the Lyapunov stability of all the closed-loop signals with the proposed adaptive controller, as will be shown by Theorem 1.2 and Theorem 1.3.

4.4 Stability results for unstructured uncertainty

Theorem 1.2. Consider the uncertain system dynamics given inEq. (4), uncertainty parametrization in Assumption 1.2, reference model inEq. (8), control input inEq. (12), filter states inEq. (19), and adaptive law inEq. (44). Then, all the signals in closed-loop system are uniformly ultimately bounded.

Proof: Consider the following radially unbounded Lyapunov function:

V=12eTPe+12trW˜TX−1W˜≤12λmaxP∥e∥2+12λmaxΓ−1+γms∥W˜∥2E47

Note that V00=0 and VeW˜≠0 for all eW˜≠00, ∀t∈R+. Time derivative of the Lyapunov function along the system trajectories in Eqs. (13) and (44) is given as follows:

V̇=eTPė+trW˜TX−1W˜̇+12trW˜TX−1W˜=−12eTQe−trW˜TX−1Ŵ̇+γmtrW˜Tϕ̇fϕfTW˜E48

Substituting the weight update law in Eq. (44) into Eq. (48) gives:

V̇=−12eTQe−γ1trW˜TϕfϕfTW˜−γ2trW˜TW˜−γ1trW˜TϕfεfT−γmtrW˜Tϕ̇fεfT−γ2trW˜TΘ̂−W=−12eTQe−γ1∥W˜Tϕf∥F2−γ2∥W˜∥F2−γ1trW˜TϕfεfT−γmtrW˜Tϕ̇fεfT−γ2trW˜TΘ̂−W≤−12λminQ∥e∥2−γ1∥W˜Tϕf∥2−γ2∥W˜∥2+∥W˜∥d4=−12λminQ∥e∥2−γ1∥W˜Tϕf∥2−γ2∥W˜∥∥W˜∥−d4E49

where d4≜γ1ε0s+γmε0d3+γ2d1, ∥Θ̂−W∥≤d1 is guaranteed from Theorem 1.1, ∥ϕ̇f∥≤d3 holds for ϕx consisting of smooth Gaussian kernels, with d3>0 being an unknown positive constant. It is obvious that the time derivative of Lyapunov function is negative definite for ∥W˜∥>d4. Thus, the error signals e and W˜ asymptotically converge to a compact set, ensuring e,W˜∈L∞. Since the reference model state xr is bounded and the unknown weight matrix W is constant, boundedness of system state vector x and adaptive weights Ŵ is guaranteed.

4.5 Stability results for structured uncertainty

Theorem 1.3. Consider the uncertain system dynamics given inEq. (4), uncertainty parametrization in Assumption 1.1, reference model inEq. (8), control input inEq. (12), filter states inEq. (19), and adaptive law inEq. (44). Then, the state tracking erroretis asymptotically stable. Furthermore, all the signals in a closed-loop system are bounded. If the basis functionϕis interval exciting, then the zero-solutioneW˜Δ˜f=0,0,0is asymptotically stable.

Proof: Recall the Lyapunov function in Theorem 1.2:

V=12eTPe+12trW˜TX−1W˜≤12λmaxP∥e∥2+121λminΓ+γms∥W˜∥2E50

with its time derivative being

V̇=−12λminQ∥e∥2−γ1∥W˜Tϕf∥2−γ2∥W˜∥∥W˜∥−d4E51

where d4=γ1ε0s+γmε0d3+γ2d1. With structured uncertainty parameterization, we have ε=εf=ϵ=0, yielding d4=γ2d1 and Δ˜f=W˜Tϕf. The discussions for boundedness of all system signals in Theorem 1.2 also hold in structured uncertainty cases with a smaller bound on W˜. It can also be shown that the error signals e,Δ˜f,W˜ are asymptotically stable by incorporating the results from Corollary 1.1. Specifically, upper bound d1 can be replaced with d2t, i.e., d4=γ2d2t where d2t→0 as t→∞ exponentially at a rate of β2. Thus,

V̇t→−12λminQ∥e∥2−γ1∥Δ˜f∥2−γ2∥W˜∥2<0,ast→∞E52

Therefore, the tracking error e low-frequency uncertainty estimation error Δ˜f, and weight estimation error W˜ asymptotically converge to zero, ensuring the asymptotic stability of the zero-solution eW˜Δ˜f=0,0,0.

4.6 Discussion

The proposed information recovery-based composite model reference adaptive controller (IR-CMRAC) acquires all the benefits of CMRAC as IR-CMRAC update law already contains the minimizing solution in Eq. (27). Additionally, the proposed method exhibits significant improvements as the following:

As the design parameter γm tends to zero, the learning rate X in Eq. (45) tend to Γ, furthermore, with γ2=0, the entire proposed modification term results in CMRAC law in Eq. (25). Hence, the proposed law can be considered as a generalized form of CMRAC.
Time derivative of gradient of the cost function in Eq. (27) is given by

ddt∇JŴ=2ϕ̇fϕfTW˜−ϕfϕfTŴ̇b−ϕfϕfTŴ̇mE53

It can be observed that the proposed update law in Eq. (44) contains the derivative information given in Eq. (53). Incorporating the derivative into traditional gradient descent-based optimization allows the shape of the transient behavior. In that respect, it can be considered analogous to the derivative action in traditional proportional/integral/derivative (PID) controllers.

Each term in Eq. (53) contributes to the adaptation performance in different manners. Specifically, the first term, ‘ϕ̇fΔ˜f’, compensates for the information lost while filtering the basis function ϕ by including the high-frequency content of basis ϕ. Recall that ϕ̇f is given by ϕ̇f=ωfϕ−ϕf; that is, ϕ̇f actually contains only the high-frequency components of basis ϕ. It should also be noted that the product of ϕ̇f and ϕf is still a high-frequency signal.
The second term in Eq. (53), ‘ϕfϕfTŴ̇b’, regulates the standard MRAC through gain γm. In this context, this term can be considered analogous to proportional feedback control for the standard adaptive law, acting akin to a stability augmentation system. Essentially, it entails readjusting the learning rate Γ to prevent adverse effects on adaptation from large values in the basis function ϕ.
The last term in Eq. (53), ‘ϕfϕfTŴ̇m’, regulates the learning rate of the modification term in a similar way to the standard adaptive law regulation explained in previous item.
Last but not the least, term ‘γ2Θ̂−Ŵ’ in Eq. (44) plays a vital role in ensuring the parameter convergence in structured uncertainty parameterization; i.e. Ŵt→W as t→∞. In that respect, this term acts like an integral action in a traditional PID controllers as it analogously allows to remove the steady state error in the weight estimations. Specifically, the auxiliary weight Θ̂ evolves in the direction of minimizing the steady-state error in the parameter estimation, and the term ‘γ2Θ̂−Ŵ’ ensures that the adaptive weights Ŵ stay bounded around Θ̂, which inherently results in zero steady-state error in W˜. For unstructured uncertainty parameterization, boundedness of adaptive weights around close neighborhood of the ideal parameters are achieved (instead of parameter convergence).

4.7 Inverse-free weight update law

One might observe that the updated law proposed in Eq. (44) necessitates computing the inverse of a (potentially large matrix) during online operations. This would undoubtedly incur additional computational costs and practical challenges. To address these drawbacks, we proceed to manipulate this term to achieve an inverse-free learning rate.

Lemma 1.2. Given thatΓ=ΓT≻0is positive definite matrix,γm∈R+is a positive constant scalar, andϕf∈Rs, the matrixΓ−1+γmϕfϕfTis invertible and its inverse is given by

Γ−1+γmϕfϕfT−1=Γ−11+trγmϕfϕfTΓΓγmϕfϕfTΓ≻0E54

Proof: Note that Γ=ΓT≻0 and γmϕfϕfT≽0. This fact implies Γ−1+γmϕfϕfT is also a symmetric positive definite matrix, i.e. Γ−1+γmϕfϕfT≻0. Then, Γ−1+γmϕfϕfT is always invertible. Let us define the following for clarity:

E≜γmϕfϕfT=ϕfγmϕfγmTG≜Γ−1→G−1=ΓE55

Then,

EG−1E=ϕfγmϕfγmTΓϕfγmϕfγmT=ϕfγmTΓϕfγmϕfγmϕfγmT=trϕfγmϕfγmTΓϕfγmϕfγmT=trEG−1EE56

Le G+E−1 be in the form of G+E−1=G−1−νG−1EG−1. Then, their product

G+EG−1−νG−1EG−1=I−νEG−1+EG−1−νEG−1EG−1E57

gives identity matrix if the equality νEG−1−EG−1+νEG−1EG−1=0 is satisfied. Then, combining this inequality with the result in Eq. (56) yields:

νEG−1−EG−1+νtrEG−1EG−1=ν−1+νtrEG−1EG−1=0E58

This equality holds for any square matrix E with rankE=1 and invertible matrix G if the parameter v is chosen as

ν=11+trEG−1=11+trγmϕfϕfTΓE59

Substituting v into G+E−1=G−1−νG−1EG−1 results in

Γ−1+γmϕfϕfT−1=Γ−11+trγmϕfϕfTΓγmΓϕfϕfTΓE60

That completes the proof. Readers may refer to [40] for details.

5. Illustrative numerical examples

5.1 Systems with integrator dynamics

In this example, the first order roll dynamics of an aircraft is considered:

ṗt=Lppt+Lδδat+ΔpE61

where pt is roll rate, δat is aileron input, dynamic stability derivative Lp denoting the roll damping is Lp=−1, control derivative is Lδ=1, and the uncertainty Δp is given by

Δp=0.2+p∣p∣E62

This numerical example can be regarded as an incorrect characterization of the roll mode time constant and the existence of constant wind disturbance trying to roll the aircraft. To improve the tracking performance robustness, an integrator state xi is augmented to the system as follows:

ṗtẋit=Lp0−10ptxit+Lδ0δat+Δp+01pctE63

where pc is the commanded roll rate.

It is important to note that standard MRAC suffers from degraded performance in systems with augmented integrator dynamics if the learning rate is relatively high. As clearly seen in Figure 1, increasing the learning rate induces high-frequency oscillations in the system. In this example, the efficacy of the proposed method will also be highlighted for the systems with augmented integrator dynamics.

Figure 1.
Performance comparison for standard MRAC in the presence of high gains.

The reference model is chosen as:

ẋrt=−2.53−10xrt+01pctE64

Thus, the nominal controller becomes:

unt=−1.5pt+3xitE65

5.1.1 Without integral action in the adaptation (γ2=0)

Numerical simulations are carried out with sampling frequency of 100 Hz, positive definite solution P is determined using Q=I2×2, and low-pass filter bandwidth is ωf=4 rad/s. In addition, adaptive gains are γ1=γc=γm=1, Γ=I2×2, ΓΘ=2I2×2, τw=20 s, and γ2=0. That is, the integral action in adaptation is not activated for results shown in Figures 2 and 3. It should be noted that the common adaptive gains are kept the same to make a fair comparison; that is, γ1=γc and Γ is constant for all simulations. Lastly, the commanded roll rate pc is a square wave with an amplitude of 1 rad/s and period of 10 s (see Figure 1).

Figure 2.
Tracking performance comparison of adaptive controllers (γ2=0 in IR-CMRAC).

Figure 3.
Weight estimation performance comparison of adaptive controllers (γ2=0 in IR-CMRAC).

In Figure 2, state tracking performance for different adaptive controllers is illustrated. As clearly seen, the nominal controller results in oscillatory roll response due to insufficient damping. Among all the adaptive controllers, the best tracking performance is achieved by the IR-CMRAC with almost zero state tracking error. Additionally, the desired tracking response is realized by the aileron control input with no oscillations.

Figure 3 indicates the evolution of the weight estimation error W˜. Although the integral action is disabled with γ2=0, the parameter convergence is achieved within 15 seconds. It should be noted that the exciting signal pct is persistently exciting; hence, the parameter convergence is an expected result. However, the important point is that the fastest convergence is achieved with IR-CMRAC as more information is included in the adaptation, which adds new directions to update the adaptive weights.

Figure 4 illustrates the evolution of time-varying learning rate X. One can observe that the variation of diagonal elements X11 and X22 is more apparent during the transients. This is because the stability augmentation system becomes active especially during the high-frequency variations in the signals. In this way, the undesired effects of high-frequency signals used in the adaptation are suppressed.

Figure 4.
Evolution of time-varying learning rates in IR-CMRAC (γ2=0 in IR-CMRAC).

5.1.2 With integral action in the adaptation (γ2=10)

In this scenario, the integral action in IR-CMRAC is activated by setting the learning rate γ2=10 in Eq. (44). The state tracking performance, evolution of time-varying learning rate X, and aileron control input are not illustrated for this case since the differences between simulation results with γ2=0 and γ2=10 are indistinguishable. However, the weight estimation performance is worth mentioning and presented in Figure 5. Unlike IR-CMRAC with γ2=0, the adaptive weights keep evolving with γ2=10 even if the systems states are not excited (for example on the interval of t∈510). This is because, by the time t=5 seconds, the basis function ϕ has become sufficiently exciting, thereby fulfilling Assumption 1.4. As a result, faster convergence of the adaptive weights is achieved, as claimed by Theorem 1.3.

Figure 5.
Weight estimation performance comparison of adaptive controllers.

5.2 Presence of measurement noise and external disturbance

In this example, an external disturbance is added to the uncertainty as:

Δpt=0.2+0.23e−0.13t+0.1sin0.2t+0.13sin0.37t+p∣p∣E66

Newly introduced sinusoidal signals can be considered as the effects of variable wind speed and direction, whereas the exponential term is used to simulate the wind shear. With this definition of aggregated uncertainty, the unknown weight matrix becomes time-varying:

Wt=0.2−0.23e−0.13t+0.1sin0.2t+0.13sin0.37t1,ϕp=1p∣p∣E67

Furthermore, the measurement noise is added to the roll rate p as

pt←pt+υtE68

where υt is the Gaussian white noise with standard deviation of 2 deg/s, which practically ensures υt∈−66deg/s, ∀t≥t0. All the simulation parameters are the same as in Section 5.1.1, except for the learning rate γ2=1.

Figure 6 illustrates the state tracking performance of the proposed adaptive controller for the outlined simulation scenario. Due to the low-pass filter nature of integration, the integrator state x2 is not affected drastically by the roll rate measurement noise. Furthermore, IR-CMRAC is mainly driven by the low-pass filtered signals (for example ϕf, Δ˜f, uf in Eq. (44)); the effects of measurement noise are suppressed successfully, thereby resulting in smooth aileron control input. This result can also be observed in the time-varying learning rate components Xij given in Figure 7.

Figure 6.
State tracking performance for IR-CMRAC in the presence of noise and disturbance.

Figure 7.
Variation of components of the learning rate X for IR-CMRAC.

It is important to remember that the convergence results of Theorem 1.3 are valid if the unknown weight matrix W is constant. However, it can readily be shown that all the system signals are uniformly ultimately bounded if the unknown weight matrix is bounded and time-varying, i.e. ∥Wt∥2≤δw with δw∈R+ being unknown upper bound. In Figure 8, the evolution of the adaptive weights is illustrated in the presence of external disturbance and measurement noise. As seen in the figure, the adaptive parameters stay bounded around their actual values and are practically noise-free.

Figure 8.
Evolution of the adaptive weights for IR-CMRAC in the presence of measurement noise and disturbance (dashed line: Actual weights W(t), solid line: Estimated weights Ŵt.)

5.3 Investigation of information recovery

In this example, we conduct numerical simulations to highlight the main contribution of the proposed method: information recovery. In order to make a fair comparison with CMRAC, the integral action in the adaptation is deactivated with γ2=0. All the other simulation parameters are as in Section 5.1.1. Measurement noise and disturbance are not applied to the system dynamics. Eventually, the following update laws are compared for the plant dynamics in Eq. (61) with the uncertainty in Eq. (62):

CMRACLaw:Ŵ̇=−ΓϕeTPB+Γγ1ϕfΔ˜fTIR‐CMRACLawγ2=0:Ŵ̇=−XϕeTPB+Xγ1ϕfΔ˜fT+γmϕ̇fΔ˜fTE69

Simulation results for several values of γm are shown in Figure 9. The lost information in CMRAC (due to low-pass filtering) is included in the adaptation with nonzero γm. Hence, the weight estimation error becomes smaller with increasing γm up to 1. However, further increase in γm results in overshoot and oscillatory response, causing degraded weight estimation performance. This can be considered analogous to the excessive derivative gain in a traditional PID controller, where the unnecessarily large derivative gains might cause instability. It is clear from the figure that judicious tuning of the gain γm results in significantly improved adaptive weight estimation and hence, the desired state tracking performance.

Figure 9.
The effect of γm on the weight estimation error for IR-CMRAC.

5.4 Adaptation with non-PE signals

The parameter convergence in both standard MRAC and CMRAC is dependent on the stringent requirement of persistent excitation of basis function ϕx. Note that PE condition of ϕx depends on the persistent excitation of system states x [41]. Furthermore, states x are persistently exciting provided that the exogenous reference input r(t) meets PE condition [4]. In this section, we provide simulation results with non-PE reference signal to illustrate the efficacy of the proposed method under interval excitation (IE) condition, which is much weaker than PE as described in Remark 1.1.

Figure 10 compares the state-tracking performances of MRAC, CMRAC, and IR-CMRAC. In the simulated flight case, the aircraft performs a right bank maneuver and returns back to wings-level flight condition exponentially.

Figure 10.
State tracking performance with non-PE reference signal.

As seen clearly from Figure 10, the reference signal is not persistently exciting but satisfies the interval excitation condition. The parameter convergence is achieved without persistent excitation, as suggested by Corollary 1.1 and Theorem 1.3 (see Figure 11).

Figure 11.
Weight estimation error with non-PE reference signal.

6. Software-in-the-loop (SITL) simulations for lateral flight control

6.1 X-plane flight simulator and lateral dynamics

X-plane flight simulator operates according to the blade element theory, where the physics engine calculates velocity components for each blade element. These components stem from various sources like rotational motion, free-stream velocity, propeller inflow, and downwash and wake caused by aerodynamic surfaces and fuselage. Thus, the aircraft model realized by the X-plane flight simulator is highly nonlinear and realistic, which makes X-plane a Federal Aviation Administration (FAA) certified flight simulator [42].

Linear equations of motion for the lateral dynamics of F-4 Phantom II at steady wings-level trim conditions with true airspeed of V∞=360 knots and altitude of h=24000 ft. are obtained using ‘Athena Vortex Lattice’ [43] as:

β̇ṗṙϕ̇=−1.470.0−1.00.075−18.85−7.522.890.015.150.56−7.610.00.01.00.00.0⏟≜Apβprϕ⏟≜xp+0.0−0.158.48−0.96−0.183.940.00.0⏟≜Bpδaδr⏟≜uyp=βϕ=10000001xp=EpxpE70

where β is sideslip angle [rad], p and r are roll and yaw rates rad/s (resp.), and ϕ is bank angle [rad]. Aileron and rudder inputs are normalized with maximum deflection of 20 degrees to be δa∈−11 and δr∈−11, respectively.

6.2 Nominal controller design

In order to increase the robustness and improve the tracking performance, integrator states are introduced as follows:

żt=βcmdtϕcmdt−βtϕtżt=rt−ypt=rt−EpxptE71

Then, augmented system dynamics can be expressed as follows:

ẋpż=Ap0−Ep0⏟≜Axpz⏟≜x+Bp0⏟≜Bδaδr+0I⏟≜Brβcmdϕcmdẋ=Ax+Bu+BrrE72

Reference model satisfying Ar=A−BK is given as

ẋr=Arxr+BrrE73

The nominal controller unt=−Kx is designed using feedback gain K:

K=1.141.15−0.814.14−3.25−3.02−0.58−0.690.09−1.324.760.97E74

The fastest mode is the roll subsidence with frequency of 7.66 rad/s. Hence, the cut-off frequency of the low-pass filter is chosen to be ωf=8rad/s.

6.3 SITL simulation results

In this section, we demonstrate SITL simulation results of the F-4 Phantom fighter jet’s lateral dynamics. The objective is to execute consecutive left and right bank maneuvers (see Figure 12) while ensuring minimal sideslip to uphold coordinated flight. Initially, adaptation remains inactive for 70 seconds, and then the proposed adaptive controller is activated. The communication between the controller and the X-plane flight simulator is established through the user datagram protocol (UDP) port. Hence, an unknown communication delay is present in the simulations.

Figure 12.
Left and right Bank Maneuvers as the flight scenario.

In Figure 13, state tracking performance is illustrated. As the proposed adaptation is activated, the sideslip angle gets smaller, resulting in better-coordinated bank maneuvers. Furthermore, the oscillations, especially in the roll channel, are significantly reduced with the adaptation.

Figure 13.
State tracking performance for SITL simulation with Γ=ΓΘ=2I, γm=10, τw=30, γ1=1, and γ2=10.

Figures 14 and 15 illustrate the commanded and actuated control surface commands. It is important to highlight that the improvements in state tracking performance are not because of the excessive use of the controls but due to the effective cancelation of the uncertainties. This is clear as the commanded control surface deflections significantly decrease with the activation of proposed adaptive controller.

Figure 14.
Aileron control for SITL simulation with Γ=ΓΘ=2I, γm=10, τw=30, γ1=1, and γ2=10.

Figure 15.
Rudder control for SITL simulation with Γ=ΓΘ=2I, γm=10, τw=30, γ1=1, and γ2=10.

Note that the uncertainty in SITL flight simulation is not guaranteed to be matched. That is, some portion of the uncertainty may not lie in the range space of input matrix B. Corresponding plant dynamics can be expressed as:

ẋ=Ax+Bu+Δx+DΔuxE75

with BTD=0 and Δux being unmatched uncertainty. In this case, the reference model state tracking cannot be achieved since it is not possible to suppress all the effects of uncertainties with the control input. Instead, controlled output tracking can be achieved by introducing the command governor controls for the unmatched uncertainties [27].

7. Conclusions

In this chapter, a new model reference adaptive control architecture is proposed to recover the lost information in filter-based adaptive controllers such as composite/combined MRAC. In CMRAC-like formulations, the uncertainty estimation is achieved by filtering the system dynamics, which significantly improves the transient performance. However, filtering the high-frequency content in the system signals may degrade the adaptation due to lost information. This adverse effect can become even more crucial if the low-pass filter bandwidth is remarkably small. With the proposed method, the high-frequency content of the uncertainty estimation is integrated in a frequency-selective manner into the adaptive law to recover the information lost during filtering. Furthermore, learning rates in the weight update law become time-varying so that the proposed modification term behaves as a stability augmentation system. In that regard, the bandwidth of the closed-loop system is adjusted online. The efficacy of the proposed solution is illustrated through numerical simulations of aircraft roll dynamics and software-in-the-loop simulations for lateral dynamics of F-4 Phantom II fighter aircraft through the X-plane flight simulator. Furthermore, rigorous stability proof is provided by Lyapunov’s stability theorem.

References

1. Boyd S, Sastry SS. Necessary and sufficient conditions for parameter convergence in adaptive control. Automatica. 1986;22(6):629-639
2. Ioannou PA, Kokotovic PV. Instability analysis and improvement of robustness of adaptive control. Automatica. 1984;20(5):583-594
3. Narendra KS, Annaswamy A. A new adaptive law for robust adaptation without persistent excitation. IEEE Transactions on Automatic Control. 1987;32(2):134-145
4. Pomet J-B, Praly L. Adaptive nonlinear regulation: Estimation from the Lyapunov equation. IEEE Transactions on Automatic Control. 1992;37(6):729-740
5. Nguyen N, Krishnakumar K, Boskovic J. An optimal control modification to model-reference adaptive control for fast adaptation. In: AIAA Guid. Navig. Control Conf. Honolulu, HI: AIAA; 2008
6. Calise AJ, Yucelen T, Muse JA, Yang B-J. A loop recovery method for adaptive control. In: AIAA Guid. Navig. Control Conf. Chicago, IL: AIAA; 2009
7. Yucelen T, Calise AJ. Kalman filter modification in adaptive control. Journal of Guidance, Control, and Dynamics. 2010;33(2):426-439
8. Slotine J-JE, Li W. Composite adaptive control of robot manipulators. Automatica. 1989;25(4):509-519
9. Volyanskyy K, Calise AJ, Yang B-J, Lavretsky E. An error minimization method in adaptive control. In: AIAA Guid. Navig. Control Conf. Keystone, CO: AIAA; 2006
10. Adetola V, Guay M. Performance improvement in adaptive control of linearly parameterized nonlinear systems. IEEE Transactions on Automatic Control. 2010;55(9):2182-2186
11. Yucelen T, De La Torre G, Johnson EN. Improving transient performance of adaptive control architectures using frequency-limited system error dynamics. International Journal of Control. 2014;87(11):2383-2397
12. Höcht L, Maity A, Holzapfel F. Frequency selective learning model reference adaptive control. IET Control Theory & Applications. 2015;9(15):2257-2265
13. Na J, Yang J, Ren X, Guo Y. Robust adaptive estimation of nonlinear system with time-varying parameters. International Journal of Adaptive Control and Signal Processing. 2015;29(8):1055-1072
14. Basu Roy S, Bhasin S. Novel model reference adaptive control architecture using semi-initial excitation-based switched parameter estimator. International Journal of Adaptive Control and Signal Processing. 2019;33(12):1759-1774
15. Pan Y, Sun T, Yu H. On parameter convergence in least squares identification and adaptive control. International Journal of Robust and Nonlinear Control. 2019;29(10):2898-2911
16. Glushchenko AI, Petrov VA, Lastochkin KA. Adaptive control system with a variable adjustment law gain based on the recursive least squares method. Automation and Remote Control. 2021;82(4):619-633
17. Yucelen T, Haddad WM. Low-frequency learning and fast adaptation in model reference adaptive control. IEEE Transactions on Automatic Control. 2012;58(4):1080-1085
18. Nguyen N, Burken J, Hanson C. Optimal control modification adaptive law with covariance adaptive gain adjustment and normalization. In: AIAA Guid. Navig. Control Conf. Portland, OR: AIAA; 2011. p. 6606
19. Yayla M, Kutay AT. A modification to adaptive control with time-varying learning rate for improved transient performance. In: AIAA SciTech Forum. San Diego, CA: AIAA; 2022
20. Lavretsky E. Combined/composite model reference adaptive control. IEEE Transactions on Automatic Control. 2009;54(11):2692-2697
21. Yang J, Na J, Gao G. Robust model reference adaptive control for transient performance enhancement. International Journal of Robust and Nonlinear Control. 2020;30(15):6207-6228
22. Lewis F, Jagannathan S, Yesildirek A. Neural Network Control of Robot Manipulators and Non-linear Systems. UK: CRC Press; 1998
23. Park J, Sandberg IW. Universal approximation using radial basis function networks. Neural Computation. 1991;3:246-257
24. Cybenko G. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems. 1989;2(4):303-314
25. Liu Q, Li D, Ge SS, Guo Y. Adaptive feedforward rbf neural network control with the deterministic persistence of excitation. Neural Computing and Applications. 2021;33(24):17013-17028
26. Kim S-H, Lee H, Cho N, Kim Y. Data-efficient active weighting algorithm for composite adaptive control systems. IEEE Transactions on Automatic Control. 2022
27. Yayla M, Arabi E, Kutay AT, Yucelen T. Command governor-based adaptive control for dynamical systems with matched and unmatched uncertainties. International Journal of Adaptive Control and Signal Processing. 2018;32(8):1124-1144
28. Tao G. Adaptive Control Design and Analysis. New York, NY, USA: John Wiley & Sons, Inc.; 2003
29. Ioannou PA, Sun J. Robust adaptive control. In: Control Theory. PTR Prentice-Hall; 1996
30. Lavretsky E, Wise K. Robust and Adaptive Control: With Aerospace Applications. London: Advanced Textbooks in Control and Signal Processing, Springer; 2012
31. Khalil HK. Nonlinear Systems. New York, NY: Prentice Hall; 2002
32. Lavretsky E, Gibson TE. Projection operator in adaptive systems. arXiv preprint arXiv:1112.4232. 2011
33. Sun J. A modified model reference adaptive control scheme for improved transient performance. IEEE Transactions on Automatic Control. 1993;38(8):1255-1259
34. Gibson T, Annaswamy A, Lavretsky E. Improved transient response in adaptive control using projection algorithms and closed loop reference models. In: AIAA Guid. Navig. Control Conf. Minneapolis, MN: AIAA; 2012. p. 4775
35. Duarte MA, Narendra KS. Combined direct and indirect approach to adaptive control. IEEE Transactions on Automatic Control. 1989;34(10):1071-1075
36. Guo K, Pan Y. Composite adaptation and learning for robot control: A survey. Annual Reviews in Control. 2023;55:279-290
37. Cho N, Shin H-S, Kim Y, Tsourdos A. Composite model reference adaptive control with parameter convergence under finite excitation. IEEE Transactions on Automatic Control. 2017;63(3):811-818
38. Chowdhary G, Johnson E. A singular value maximizing data recording algorithm for concurrent learning. In: Proceedings of the American Control Conference. San Francisco, CA: IEEE; 2011. pp. 3547-3552
39. Yayla M, Kutay AT. Guaranteed exponential convergence without persistent excitation in adaptive control. In: UKACC 11th International Conference on Control. Belfast: IEEE; 2016. pp. 1-6
40. Miller KS. On the inverse of the sum of matrices. Mathematics Magazine. 1981;54(2):67-72
41. Gorinevsky D. On the persistency of excitation in radial basis function network identification of nonlinear systems. IEEE Transactions on Neural Networks. 1995;6(5):1237-1244
42. Laminar Research. X-plane: Flight simulator. X-Plane Version 11. 2022
43. Mark D, Harold Y. Athena vortex lattice. AVL 3.6. 2022.

[1] 1. Boyd S, Sastry SS. Necessary and sufficient conditions for parameter convergence in adaptive control. Automatica. 1986;22(6):629-639

[2] 2. Ioannou PA, Kokotovic PV. Instability analysis and improvement of robustness of adaptive control. Automatica. 1984;20(5):583-594

[3] 3. Narendra KS, Annaswamy A. A new adaptive law for robust adaptation without persistent excitation. IEEE Transactions on Automatic Control. 1987;32(2):134-145

[4] 4. Pomet J-B, Praly L. Adaptive nonlinear regulation: Estimation from the Lyapunov equation. IEEE Transactions on Automatic Control. 1992;37(6):729-740

[5] 5. Nguyen N, Krishnakumar K, Boskovic J. An optimal control modification to model-reference adaptive control for fast adaptation. In: AIAA Guid. Navig. Control Conf. Honolulu, HI: AIAA; 2008

[6] 6. Calise AJ, Yucelen T, Muse JA, Yang B-J. A loop recovery method for adaptive control. In: AIAA Guid. Navig. Control Conf. Chicago, IL: AIAA; 2009

[7] 7. Yucelen T, Calise AJ. Kalman filter modification in adaptive control. Journal of Guidance, Control, and Dynamics. 2010;33(2):426-439

[8] 8. Slotine J-JE, Li W. Composite adaptive control of robot manipulators. Automatica. 1989;25(4):509-519

[9] 9. Volyanskyy K, Calise AJ, Yang B-J, Lavretsky E. An error minimization method in adaptive control. In: AIAA Guid. Navig. Control Conf. Keystone, CO: AIAA; 2006

[10] 10. Adetola V, Guay M. Performance improvement in adaptive control of linearly parameterized nonlinear systems. IEEE Transactions on Automatic Control. 2010;55(9):2182-2186

[11] 11. Yucelen T, De La Torre G, Johnson EN. Improving transient performance of adaptive control architectures using frequency-limited system error dynamics. International Journal of Control. 2014;87(11):2383-2397

[12] 12. Höcht L, Maity A, Holzapfel F. Frequency selective learning model reference adaptive control. IET Control Theory & Applications. 2015;9(15):2257-2265

[13] 13. Na J, Yang J, Ren X, Guo Y. Robust adaptive estimation of nonlinear system with time-varying parameters. International Journal of Adaptive Control and Signal Processing. 2015;29(8):1055-1072

[14] 14. Basu Roy S, Bhasin S. Novel model reference adaptive control architecture using semi-initial excitation-based switched parameter estimator. International Journal of Adaptive Control and Signal Processing. 2019;33(12):1759-1774

[15] 15. Pan Y, Sun T, Yu H. On parameter convergence in least squares identification and adaptive control. International Journal of Robust and Nonlinear Control. 2019;29(10):2898-2911

[16] 16. Glushchenko AI, Petrov VA, Lastochkin KA. Adaptive control system with a variable adjustment law gain based on the recursive least squares method. Automation and Remote Control. 2021;82(4):619-633

[17] 17. Yucelen T, Haddad WM. Low-frequency learning and fast adaptation in model reference adaptive control. IEEE Transactions on Automatic Control. 2012;58(4):1080-1085

[18] 18. Nguyen N, Burken J, Hanson C. Optimal control modification adaptive law with covariance adaptive gain adjustment and normalization. In: AIAA Guid. Navig. Control Conf. Portland, OR: AIAA; 2011. p. 6606

[19] 19. Yayla M, Kutay AT. A modification to adaptive control with time-varying learning rate for improved transient performance. In: AIAA SciTech Forum. San Diego, CA: AIAA; 2022

[20] 20. Lavretsky E. Combined/composite model reference adaptive control. IEEE Transactions on Automatic Control. 2009;54(11):2692-2697

[21] 21. Yang J, Na J, Gao G. Robust model reference adaptive control for transient performance enhancement. International Journal of Robust and Nonlinear Control. 2020;30(15):6207-6228

[22] 22. Lewis F, Jagannathan S, Yesildirek A. Neural Network Control of Robot Manipulators and Non-linear Systems. UK: CRC Press; 1998

[23] 23. Park J, Sandberg IW. Universal approximation using radial basis function networks. Neural Computation. 1991;3:246-257

[24] 24. Cybenko G. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems. 1989;2(4):303-314

[25] 25. Liu Q, Li D, Ge SS, Guo Y. Adaptive feedforward rbf neural network control with the deterministic persistence of excitation. Neural Computing and Applications. 2021;33(24):17013-17028

[26] 26. Kim S-H, Lee H, Cho N, Kim Y. Data-efficient active weighting algorithm for composite adaptive control systems. IEEE Transactions on Automatic Control. 2022

[27] 27. Yayla M, Arabi E, Kutay AT, Yucelen T. Command governor-based adaptive control for dynamical systems with matched and unmatched uncertainties. International Journal of Adaptive Control and Signal Processing. 2018;32(8):1124-1144

[28] 28. Tao G. Adaptive Control Design and Analysis. New York, NY, USA: John Wiley & Sons, Inc.; 2003

[29] 29. Ioannou PA, Sun J. Robust adaptive control. In: Control Theory. PTR Prentice-Hall; 1996

[30] 30. Lavretsky E, Wise K. Robust and Adaptive Control: With Aerospace Applications. London: Advanced Textbooks in Control and Signal Processing, Springer; 2012

[31] 31. Khalil HK. Nonlinear Systems. New York, NY: Prentice Hall; 2002

[32] 32. Lavretsky E, Gibson TE. Projection operator in adaptive systems. arXiv preprint arXiv:1112.4232. 2011

[33] 33. Sun J. A modified model reference adaptive control scheme for improved transient performance. IEEE Transactions on Automatic Control. 1993;38(8):1255-1259

[34] 34. Gibson T, Annaswamy A, Lavretsky E. Improved transient response in adaptive control using projection algorithms and closed loop reference models. In: AIAA Guid. Navig. Control Conf. Minneapolis, MN: AIAA; 2012. p. 4775

[35] 35. Duarte MA, Narendra KS. Combined direct and indirect approach to adaptive control. IEEE Transactions on Automatic Control. 1989;34(10):1071-1075

[36] 36. Guo K, Pan Y. Composite adaptation and learning for robot control: A survey. Annual Reviews in Control. 2023;55:279-290

[37] 37. Cho N, Shin H-S, Kim Y, Tsourdos A. Composite model reference adaptive control with parameter convergence under finite excitation. IEEE Transactions on Automatic Control. 2017;63(3):811-818

[38] 38. Chowdhary G, Johnson E. A singular value maximizing data recording algorithm for concurrent learning. In: Proceedings of the American Control Conference. San Francisco, CA: IEEE; 2011. pp. 3547-3552

[39] 39. Yayla M, Kutay AT. Guaranteed exponential convergence without persistent excitation in adaptive control. In: UKACC 11th International Conference on Control. Belfast: IEEE; 2016. pp. 1-6

[40] 40. Miller KS. On the inverse of the sum of matrices. Mathematics Magazine. 1981;54(2):67-72

[41] 41. Gorinevsky D. On the persistency of excitation in radial basis function network identification of nonlinear systems. IEEE Transactions on Neural Networks. 1995;6(5):1237-1244

[42] 42. Laminar Research. X-plane: Flight simulator. X-Plane Version 11. 2022

[43] 43. Mark D, Harold Y. Athena vortex lattice. AVL 3.6. 2022.

Information Recovery in Composite Model Reference Adaptive Control

Latest Adaptive Control Systems [Working Title]

Abstract

Keywords

Author Information

Metehan Yayla*

Ali Turker Kutay

1. Introduction

2. Notation and definitions

3. Problem definition

3.1 System description

3.2 Standard MRAC review

3.3 Composite/combined MRAC review

4. Main result

4.1 Integrated uncertainty estimation error

4.2 Auxiliary weight update law

4.3 Primary layer of adaptation

4.4 Stability results for unstructured uncertainty

4.5 Stability results for structured uncertainty

4.6 Discussion

4.7 Inverse-free weight update law

5. Illustrative numerical examples

5.1 Systems with integrator dynamics

Figure 1.

5.1.1 Without integral action in the adaptation (γ2=0)

Figure 2.

Figure 3.

Figure 4.

5.1.2 With integral action in the adaptation (γ2=10)

Figure 5.

5.2 Presence of measurement noise and external disturbance

Figure 6.

Figure 7.

Figure 8.

5.3 Investigation of information recovery

Figure 9.

5.4 Adaptation with non-PE signals

Figure 10.

Figure 11.

6. Software-in-the-loop (SITL) simulations for lateral flight control

6.1 X-plane flight simulator and lateral dynamics

6.2 Nominal controller design

6.3 SITL simulation results

Figure 12.

Figure 13.

Figure 14.

Figure 15.

7. Conclusions

References