Russian Journal of Nonlinear Dynamics, 2023, vol. 19, no. 4, pp. 633-646. Full-texts are available at http://nd.ics.org.ru DOI: 10.20537/nd231212
NONLINEAR ENGINEERING AND ROBOTICS
MSC 2010: 93C10, 93C40, 68T40
Enhanced Adaptive Control over Robotic Systems via Generalized Momentum Dynamic Extensions
S. Nedelchev, L. Kozlov, R. Khusainov, I. Gaponov
Adaptive control and parameter estimation have been widely employed in robotics to deal with parametric uncertainty. However, these techniques may suffer from parameter drift, dependence on acceleration estimates and conservative requirements for system excitation. To overcome these limitations, composite adaptation laws can be used. In this paper, we propose an enhanced composite adaptive control approach for robotic systems that exploits the acceleration-free momentum dynamics and regressor extensions to offer faster parameter and tracking convergence while relaxing excitation conditions and providing a clear physical interpretation. The effectiveness of the proposed approach is validated through experimental evaluation on a 3-DoF robotic leg.
Keywords: adaptive control, parameter estimation, motion control
1. Introduction
Model-based approaches have been widely used in the development of robotic systems to formulate provably stable algorithms for manipulation, locomotion, and force rendering. However,
Received November 22, 2023 Accepted December 15, 2023
This research was supported by the Russian Science Foundation (RSF) and Ministries of the Republic of Tatarstan (grant agreement 22-21-20108).
Simeon Nedelchev
s.nedelchev@innopolis.university
Lev Kozlov
l.kozlov@innopolis.university Ramil Khusainov r.khusainov@innopolis.ru
Center for Technologies in Robotics and Mechatronics Components, Innopolis University ul. Universitetskaya 1, Innopolis, 420500 Russia
Igor Gaponov i.gaponov@ucl.ac.uk
Department of Computer Science, University College London, London, United Kingdom Gower St, London, WC1E 6BT UK
these algorithms highly depend on the accuracy of parameter estimates, such as the inertia tensor, mass, and friction coefficients. In real-life scenarios, these parameters may not be accurately known in advance or change over time due to various factors. To address this, researchers have explored adaptive control and online parameter estimation techniques that update controller parameters in real-time based on system feedback.
The field of adaptive control and estimation has been extensively studied for several decades, with significant progress made by the robotics community in the 1980s and 90s. One of the most notable contributions was the passivity-based controller proposed by Slotine and Li [17]. However, this celebrated controller in general does not yield parameter convergence, which results in numerical drift, sensitivity to noise, and dependence on persistency of excitation (PE). To overcome these issues, researchers combined the tracking-based adaptive law with online parameter estimation techniques, proposing the composite adaptive control framework [11, 16]. Although composite adaptation significantly improves parameter error dynamics, the convergence still depends upon PE. One way to combat this is to increase the amount of data present in the underlying model through regression extension, which was first proposed by Lion [15] and Kreisselmeier [13]. This idea has been exploited in various scenarios, leading to the framework of concurrent learning [23, 31]. The attractive properties of such estimators are that the norm of the parameter error is monotonically decreasing with convergence supported by a condition strictly weaker than PE, namely, that of square integrability of eigenvalues of extended regres-sor matrices [14, 29, 30]. Thanks to the above properties, these estimators have been used in safety-critical controllers [10, 33].
The aforementioned estimation techniques rely on the assumption that some of the system outputs can be represented linearly with respect to the set of constant parameters. In robotics, this linear representation is known as the regressor form [18] and is the standard in identification [24]. However, as most models of robotic systems are given by second-order differential equations, the underlying linear representation is dependent upon accelerations, which are usually subject to significant noise. To overcome this limitation, several techniques have been proposed, such as filtering based on integration by parts [25] or exploitation of linearly parameterized physical quantities as mechanical power and energy [20, 21]. However, the resulting scalar power or energy regressors may require more data due to the "mixing" of the different components of the dynamics. Meanwhile, alternative acceleration-free models have been effectively exploited in the context of disturbance observers [6, 7]. Relying on the filtered model of generalized momentum, this technique exploits the Hamiltonian view of dynamics, facilitating the transformation of second-order dynamics into pairs of first-order governing equations. However, to the best of our knowledge, this technique has not yet been studied in the context of adaptation.
In this paper, we propose a composite adaptation approach that effectively combines momentum dynamics with filtered regressor extension. This approach allows for faster parameter and tracking convergence while relaxing the excitation conditions, as demonstrated through analytical analysis and experimental evaluation.
The remainder of this paper is organized as follows. In the next section, we provide a short background on models of robotic systems, passivity-based adaptive control and its composite modification. In Section 3, the momentum-based extended regressors are proposed and integrated into the framework of composite adaptation, whose convergence is proven via the Lyapunov direct method. In Section 4, we provide an experimental evaluation of the proposed adaptation law on a practical 3-DoF robotic manipulator (a leg of a commercial quadruped robot) and compare the performance of the proposed controller with a conventional scheme. Finally, we present a relevant discussion, conclusions, and outline avenues for future work.
2. Linear parameterization and adaptive control
In this section, we will briefly review the basic concepts of dynamics, linear parameterization, conventional passivity-based adaptive control, and composite adaptation, which will facilitate the understanding of the proposed scheme.
2.1. Dynamics and linear parameterization
The dynamics of a robotic system can be described by the equations of motion which relate the forces acting on the system to the second-order derivatives of coordinates q G Rra:
M(q)q + C(q, q)q + g(q) = u. (2.1)
Here, u G Rra represents a vector of control inputs, M G Rraxra is a positive definite symmetric inertia matrix, C G Rraxra is a matrix of Coriolis and centrifugal terms, and g G Rra describes the effect of gravity and other position-dependent (potential) forces.
The above equation of motion is typically obtained using the Lagrangian formalism:
d (ВС\ ВС .
where L(q, q, t) = K(q, q) — U(q) is the Lagrangian, which is equal to the difference of kinetic and potential energy of the system.
We focus on linearly parameterized systems, which have kinetic and potential energies expressed as linear functions ф and unknown but constant parameters в G R1:
K = vK(q, q)в, U = vU(q)e. (2.3)
Here, (q, q) G Rl and фи(q) G Rl are kinetic and potential energy regressors expressed in generalized coordinates.
Continuing with the Euler - Lagrange equations, one can obtain the dynamics i = 1, ..., n in a linearly parameterized form:
*(q, q, q)e = M(q, e)q + C(q, «q, e)q + g(q, в) = u, (2.4)
where the dynamics regressor Ф G Rrax^ consists of rows calculated as
Ф.-
у д'2Ук » ! у д'2Ук ■ дУк ! д(Ри
f^d^dqj 3 j^dq.dq. э Qq. dq.
(2.5)
This representation is widely used in robotics to develop various control schemes. One such scheme is the widely used passivity-based adaptive controller, originally proposed by Slotine and Li [17].
2.2. Passivity-based adaptive control
The passivity-based adaptive controller is given by the following expressions:
u = Yc(q, q, q„ qs)0 - Ks, (2.6)
0 = -yY (t)s. (2.7)
Here, s = q + Aq is the so-called sliding surface with A > 0 regulating the convergence of the trajectories. The tracking error between the actual q and the desired trajectory qd is defined via q = q — qd, and qs = qd — Aq is referred to as the "virtual velocity". The "controller regressor" Yc(q, q, qs, qs) is calculated in a way that allows for the linear parameterization of the virtual dynamics Yc(q, q, qs, qj6 = M(q)qs + C(q, q)qs + g(q), where Yc > 0 is the adaptation parameter (similar to a gradient descent step) that regulates the convergence of the parameter estimates, and K >- 0 regulates the speed of convergence to the sliding surface.
The convergence is proven via Lyapunov's direct method for the following Lyapunov candidate function:
V(t) = i (sTM(q)s + 7-^0), (2.8)
where 6 = 6 — 6 is referred to as the parameter estimation error.
Taking the time derivative of the equation above and substituting the controller and adaptation law dynamics yields:
V(t) = (VYc + 6 — sTKs = —sTKs < 0. (2.9)
Invoking Barballat's lemma implies that convergence to the surface s = 0 is achieved as t ^ <x>, which in turn ensures that q and q tend to 0 as well.
However, it is important to note that the adaptive controller may not necessarily result in precise estimation of the unknown parameters, generating instead the values that ensure only s ^ 0. Convergence of parameter estimates is supported by the concept of persistence of excitation (PE), which requires that the bounded signal Y(t) G Rrax^ satisfies the following:
t+T
J YT(t)Y(t) dr h cl (2.10)
t
for some T > 0, c > 0 and all t> 0.
However, this condition is rarely satisfied in practice, leading to numerical drift of parameter estimates and potential instability [12, 29]. This has motivated researchers to enhance this class of adaptive controllers, with one of the most effective techniques being the composite adaptation.
2.3. Prediction error and composite adaptation
The concept of composite adaptation encompasses the integration of a "prediction regression" within a linearly parameterized model:
y(t) = y(t) — y(t) = Y(t)0 — Y(t)6 = Y(t)0. (2.11)
Here, y represents the predicted system outputs, while the discrepancy between the prediction y and the measured output y is referred to as the "prediction error", denoted by y.
It is important to note that the same parameters 6 are utilized to parameterize the robot in the conventional adaptation scheme (2.7). However, the regressor matrices Y and Yc differ. The signals y(t) and Y(t) are highly dependent on the application and system properties. They typically represent a combination of measurable terms that describe various physical phenomena expressed in a linearly parameterizable form, with some common types briefly discussed at the beginning of the next section. Importantly, these signals are not contingent upon the specific
implementation of the controller but rather rely on the assumption that the control input is known or measurable. In the subsequent section, we will delve further into constructing one of the realizations of y(t) and Y(t) using dynamically extended generalized momentum regression. In the meantime, let us continue with a brief overview of composite adaptation.
Regarding the specific linear parameterization, the adaptation law is modified by incorporating the prediction error term with a gain parameter jp > 0, resulting in:
6 = —YcYT s — YpYTy. (2.12)
Substituting this adaptive law into the derivative of the Lyapunov candidate (2.9) yields:
V(t) = -sTKs - ^QTYTYQ < 0. (2.13)
Yc
The dynamics of the parameter estimates are governed by the following law:
0 = —ypYt (t)Y(t)6 — YcYT (t)s(t). (2.14)
This adaptation law ensures time-variant first-order error dynamics with a converging input z(t) = Y'T(t)s(t). Consequently, parameter estimates exhibit smoother and faster convergence than in the case of the conventional adaptive control thanks to the "feedback term" —yYt(t)Y(t) ^ 0. The conventional adaptation algorithm is restored by setting the composite gain yp = 0, thus indicating that composite adaptation only enhances convergence and smoothness of estimate transients. However, using the composite adaptive law raises two issues that need to be addressed, namely: choosing an accurate, noise-free linear parameterization y = Y6, and secondly, reducing excitation requirements to improve parameter convergence properties in the absence of PE. These will be studied in the next section.
3. Momentum extensions and enhanced composite adaptation
Composite adaptive control has shown promise in improving parameter estimates and trajectory tracking, but its implementation requires an additional linear parameterization. However, selecting an appropriate linear model for a general robotic system is ambiguous. The naive approach of using the full linear representation of dynamics (2.11) is challenging because it relies on accelerations q. Although several filtering techniques have been proposed in the literature [12, 25, 30], they are computationally expensive for complex systems, since integration by parts of (2.1) is required. Another alternative is linear parameterization of energy or power [20, 21], but these regressors demand more excitation and further constrain the already restrictive PE conditions, which are seldom satisfied in practice.
Alternatively, generalized momentum models have shown success in acceleration-free disturbance observers [6, 7]. As we will further show, these momentum models, coupled with regressor extensions [29], enhance the quality of composite adaptive control.
3.1. Extended momentum regressors
To build the linear prediction model, we use the notion of generalized momentum p G Rra and its linear parameterization given by
P = ^ = M(q)q = Me = d>p(q, q)0, (3.1)
where G Rraxi is the generalized momentum regressor.
Taking the time derivative of the Euler - Lagrange equations yields:
d£ dq
U
dq dq
T
We can now define the matrix $h(q, q)
" d_
dt.
^Pjc
¿>q
q) -$h(q, q)
9<fiu
¿>q
e
e + u.
T
to write: u.
(3.2)
(3.3)
Now, similarly to the design of disturbance observers [6] we define the dynamics of prediction error via momentum residual:
d£ dq
u
d* * — q> — <J>,
dt * *
e u.
(3.4)
Next, we proceed with using the above in the context of filtered dynamic regressor extension — a particular technique that will alleviate the acceleration requirements and increase the dimension of regression.
Fig. 1. A schematic overview of regressor computation based on the momentum models
3.2. Acceleration-free dynamical extensions
The concept of extensions is a valuable tool for constructing high-dimensional regression models as follows:
Ye = col{YT, ... YrT}, ye = col{y1,..., yr}, (3.5)
where Ye G Rnr*1 and ye G Rnr are the extended regressors and predictions, respectively, r G N is referred to as the order of extension, while the col{x1, ..., xn} stacks up its vector arguments xk into a column vector x = [x1, ..., xn].
Various methods such as time delays or LTI filters can be used to construct the extended regressor [29]. For the proposed momentum-based models, integration can provide an acceleration-free family of regressors, but often leads to errors and drift. Alternatively, filtering techniques can be used to address these issues [30], with the stability of such a scheme investigated recently in [28].
To realize the extension filtering-based momentum framework, we apply a collection of firstorder filters with transfer functions H^s) = -J^-; to (3.3). The resulting extended regressors Y^ and predictions yi are given by
Yi + QiYi = f3i[&p(t) - *h(t)],
yi + ayi = fiu(t),
(3.6)
(3.7)
where ai > 0, ¡3i > 0 are time constants and gains of the filters, respectively. The solution of the above yields the extended regression of (3.5) and ensures that the accumulation of errors is avoided thanks to the exponential forgetting of initial conditions.
It is worth noting that the use of the filtering momentum residual makes it possible to avoid the differentiation of the momentum regressor To achieve this, we introduce the residual
momentum regressor *i(t) = Yi(t) — (3&p(t) and form a simple LTI ODE that takes the position and speed-dependent momentum parameterization as input and produces a regressor as output (Fig. 1):
* i + a^ = — (t) + $h(t)j. (3.8)
Solving the above with *i(t0) = —ft$p(t0) yields a solution equivalent to (3.6). Importantly, all calculations are based on the position and velocity q, q which we assume to be measurable, and no acceleration q is required. Moreover, the output is propagated through first-order stable filters, which further reduces the effect of noise.
We will now move on to discuss the proposed enhancement over adaptive controllers that leverages the idea of generalized momentum and regressor extensions.
3.3. Momentum enhanced composite adaptive controller
In this work, we propose an enhancement for composite adaptive controllers that leverages the generalized momentum and regressor extensions. Specifically, we enhance the composite version of the passivity-based adaptive controller by replacing the constant gain sliding feedback Ks with a varying adaptive momentum-like term M(q, y)s = A$p(q, s)°, and consequently use the extensions (3.6), (3.7) in the composite adaptive law. The resulting controller is given by
u = [Yc(q, q, qs, qs) — A$p(q, s)]°, (3.9)
r
0 = —7c[Yc — A$p(q, s)]Ts — YTyi. (3.10)
i=1
The stability and convergence analysis of the proposed scheme follows a Lyapunov candidate (2.8) whose derivative is
F(t) < —AsTM(q)s — 7P^min(t)yTy, (3.11)
where amin is the minimal singular value of extended regressor Ye.
Application of the Gronwall-Bellman inequality yields the following implication:
t
V(t) < e-2XtV(0) + y e-2X(t-T)e(t)\\0(r)||2 dr, (3.12)
0
where e(t) = ay-1 — yp^min(t).
The expression above, together with the results presented in [28, 29], suggests that the proposed controller and adaptive law exhibits several distinguishing properties:
• global asymptotic convergence of \\s\\ and \\y\\ is guaranteed, provided that amin ^ L2;
• global exponential convergence of \\s\\ and \\y\\ is ensured if o"min(t) is uniformly strictly positive or PE;
• once convergence to the sliding surface is achieved, the dynamics of parameter error norm \\y\\ is monotonically decreasing, i.e.: \\y(t2)\\ ^ \iy(t1)\, Vt2, t1 such that t2 > t1 > 0.
The properties described above share similarities with those of dynamically extended parameter estimators [28, 29] and composite/concurrent adaptation controllers [16, 31, 34]. However, the proposed scheme is applicable to a general type of robotic systems described by (2.1). Although a detailed analysis of these properties is beyond the scope of this paper due to space limitations, we provide some insights into their origins.
The asymptotic convergence of s is ensured by conventional controllers, and hence we do not elaborate on it further. On the other hand, the convergence of parameter estimates is guaranteed if the integral term in (3.12) can be made to approach zero as t ^ <x>. Invoking the mean value theorem shows that this can be achieved if amin ^ L2, which is weaker than persistent excitation on Y. Exponential convergence, however, relies on the fact that, if o"min(t) is either PE or uniformly positive definite, one can choose jp such that A < Ydp°"min(t), making the integral part uniformly negative. This, in turn, causes V(t) in (3.12) to be upper-bounded by an exponential curve e-2XtV(0). The monotonic decrease after convergence to the sliding surface is demonstrated by setting s = 0 in the definition of the Lyapunov candidate (2.8) and
its derivative (3.11). This results in ^ ^JYc^vamiYl(t)\d\ and, since y7c7p<Tmin(i) ^ 0,
|y(t)| monotonically decreases.
It is worth noting that these properties cannot be satisfied by conventional adaptive laws, as they require the prediction regressor to have full rank at least on some intervals, and thus o"min(t) = 0. This is only possible with an extended regression of order nr ^ l.
Having discussed the properties of the proposed scheme, we now proceed with the implementation and experimental evaluation on a practical robotic system.
4. Experimental evaluation
This section evaluates the performance of the proposed enhanced adaptive controller, as given by equations (3.9) and (3.10), and compares it with that of a conventional adaptive controller, given by equations (2.6) and (2.7).
Fig. 2. Photo of A1 robot with an overlaid schematic representation of the leg
The experimental framework for this investigation centered on a single limb from the Unitree A1 quadruped, as delineated in Figure 2. The leg is actuated by three identical motors with a maximum torque of 33.5 Nm and a rotation speed of 21 rad/sec. Each motor is equipped with 12-bit encoders and torque (current) sensing capabilities. The robot is equipped with a versatile API that allows real-time control over torques at a rate of 500-1000 Hz through Ethernet communication. It is imperative to highlight that the study was conducted using the
ö
0 ■ i—i
la
1
o ü
CD œ O
a o
!h ■
PL,
-Qi -<72 Is ----Id ----qr
1.5 Time t (s)
Fig. 3. Comparison of the trajectory tracking behavior of the conventional (top) and proposed (bottom) adaptive controllers
leg exclusively as a three-degree-of-freedom (3-DOF) manipulator, with the quadruped's base firmly fixed, thereby concentrating solely on the dynamics of the robotic manipulator.
4.1. Regressor models
To implement the conventional and proposed adaptive schemes, we first built a simplified model of the manipulator by assuming that the unknown parameters of each body are given by the following vector:
mixc. , miVc. , mizc. , 1xx. , fyy-, 1 zz-
l I I I Wl I
T
G R7,
where mi is the mass of the body and rc =
T
is the location of its center of mass
xc-, Vc-, zc-
t III
(COM) with respect to the body frame. Note that we assume the COM to be initially unknown and located not in the center of the frame. Ixxi, Iyy , Izz. are the diagonal components of the inertia tensor.
The potential energy regressors are given by
= gTRi(q) gT'ri(q) Ü!
T
x3
T
where ri £ R3 is the Cartesian position of the origin of the local frame connected to the fth body, Ri £ SO(3) is the rotation matrix describing the orientation of the body with respect to the world frame, and g = [0, 0, 9.81]T is the spatial gravity acceleration.
9
The kinetic energy regressors of each body are given by
=
—W v + W V
—WX, vy, + Wy, vx,
~'tUry> I ^^y ■ I 0
2W x-
2" Zi
1, ,2 1, ,2
1, ,2
2
Wx .v^ — vx
where vi(q, q) and wi(q, q) are the linear and angular velocities of the fth body, respectively, expressed in the inertial frame.
The kinematic quantities were computed as functions of joint coordinates and velocities using the pinocchio C++ library [36]. The complete dynamics, as given by equation (2.5), and momentum regressors were obtained via fast analytical derivatives with respect to q and q. Thanks to the efficient implementation of the pinocchio library, all necessary computations were completed within 1 ms, ensuring a 1 kHz sampling rate for the practical implementation of the controller.
The momentum regressors were extended with order r = 8, while gains and filter coefficients were uniformly distributed in the intervals ¡3i G [6, 10], ai G [8, 12], respectively. The resulting LTI systems, given by equations (3.7) and (3.8), were discretized exactly to provide an accurate recursive implementation without solving underlying ODEs numerically.
To obtain the relative parameters, which is important for conditioning the adaptation problem, we scaled the regressors with nominal parameters provided by the manufacturer as Ydiag{#0}. We also identified a set of basic lumped parameters °0 and their values via singular-value decomposition over sampled regressors and well-known least squares-based techniques reported in [18, 24]. Additionally, we identified the parameters of a simple viscous and dry friction model that was compensated for via a feed-forward term.
4.2. Tuning and comparison
Both controllers were given a task to track desired trajectories given by a critically damped reference model with a natural frequency w > 0:
qd + 2w(qd— qr) + w2 (qd— qr) = qr(t).
To reduce the computational burden, we discretize underlying ODEs of the reference model similarly to the LTI filters used in extensions. The reference trajectory is given by a harmonic signal:
qr = A0 + Ai sin 2nvit,
i i
where the amplitudes are chosen to account for the joint limits as A0 = [—0.1, 0.64, —1] and A = = [0.3, 1, —0.57], while the frequencies are given by v = [0.95, 0.66, 1.05].
To ensure a fair comparison between the passivity-based and proposed controllers, both were tuned with the same adaptation yc = 0.5 and sliding gain A = 7, respectively. The feedback gains in the conventional controller represented the average scaled nominal inertia along the desired
01
— e
1.5
CÔ
§ 1.0
î î î Î
r\y \l r n T
"iuvT \ ------t....
jl 1 ^^--t----
:-!-!-!-!-:
4 6
Time f (s)
10
Fig. 4. The parameter convergence under conventional (top) and proposed (bottom) adaptive laws
trajectory, K = AM. The gain for the composite adaptation was chosen as jp = 0.5, and the reference model was tuned with a natural frequency of w = 8 rad/s.
The experimental joint trajectories for both controllers are depicted in Fig. 3. The proposed controller outperformed the conventional, with a transient time of around 0.5 s, which is roughly 3 times faster than the passivity-based control. Sample temporal plots of dynamical parameter estimates (masses) are depicted in Fig. 4, with the dashed lines representing the identified parameters 00 (ground truth). The proposed adaptation law showed superior performance in terms of convergence rate, accuracy of estimates, and their smoothness, thanks to the power of composite adaptation provided by extended regression. The estimates rapidly converged to the region near the ground truth under the proposed adaptation, while the conventional estimator introduced visible oscillations and static errors. Increasing the learning gain yc in the conventional controller yielded parameter drift and instability. However, it should be noted that there are still small estimation errors in the proposed algorithm, which can be attributed to unaccounted dynamics, such as stick-slip friction and neglected off-diagonal inertia terms.
The overall performance of the controllers as summarized by the norms of |q| and |<?| is demonstrated in Fig. 5. Note how the proposed law ensures faster convergence for both parameters and tracking errors, with a monotonic, exponential-like behavior in the case of errors. It is worth mentioning that the proposed controller is also easier to tune, only requiring positive values for A, yc, and jp, all of which can be chosen to be much greater than the coefficients of the conventional controller, without compromising stability and noise rejection.
5. Conclusion
In this paper, we propose a modification to composite adaptive control for robotic systems. The proposed framework utilizes dynamically extended momentum dynamics similar to those used in designing disturbance observers. We analyze the conditions for exponential and asymptotic stability and demonstrate that the proposed modification offers faster parameter and
Conventional - Proposed
S 0.2
0.0 2.0 iSL 1.0
0.0
0 1 2 3 4 5
Time t (s)
Fig. 5. The summarized tracking (top) and parameter estimation (bottom) behavior under proposed and conventional controllers
tracking convergence, while also relaxing excitation requirements. These properties are similar to those of dynamically extended parameter estimators and composite/concurrent adaptation controllers, but can now be applied to general cases of robotic systems. The effectiveness of the proposed approach is demonstrated through experimental evaluation on a 3-DoF robotic leg, and the results demonstrate significant improvement over the conventional scheme, with 3 times faster convergence of tracking errors and an exponential-like convergence of parameter estimates.
In the future, we aim to enhance convergence behavior by using advanced mixing techniques. We also plan to account for parametric constraints via natural adaptive laws and validate the proposed techniques on various practical systems, including legged robots and quadrotor UAVs.
Conflict of interest
The authors declare that they have no conflict of interest.
References
[1] Prufer, M., Schmidt, C., and Wahl, F., Identification of Robot Dynamics with Differential and Integral Models: A Comparison, in Proc. of the IEEE Internat. Conf. on Robotics and Automation (San Diego, Calif, USA, 1994): Vol. 1, pp. 340-345.
[2] Annaswamy, A.M. and Fradkov, A.L., A Historical Perspective of Adaptive Control and Learning, Annu. Rev. Control, 2021, vol. 52, pp. 18-41.
[3] Li, W. and Slotine, J.-J.E., On-Line Parameter Estimation for Robot Manipulators, in Proc. of the IEEE Internat. Conf. on Systems, Man, and Cybernetics (Beijing, China, Aug 1988), pp. 353-356.
[4] Gautier, M. and Khalil, W., On the Identification of the Inertial Parameters of Robots, in Proc. of the 27th IEEE Conf. on Decision and Control (Austin, Tex., USA, 1988): Vol. 3, pp. 2264-2269.
[5] Gautier, M., A Comparison of Filtered Models for Dynamic Identification of Robots, in Proc. of the 35th IEEE Conf. on Decision and Control (Kobe, Japan, Dec 1996): Vol. 1, pp. 875-880.
[6] De Luca, A. and Mattone, R., Actuator Failure Detection and Isolation Using Generalized Momenta, in Proc. of the IEEE Internat. Conf. on Robotics and Automation (Taipei, Taiwan, Sep 2003): Vol. 1, pp. 634-639 (Cat. No. 03CH37422).
[7] Garofalo, G., Mansfeld, N., Jankowski, J., and Ott, Ch., Sliding Mode Momentum Observers for Estimation of External Torques and Joint Acceleration, in Proc. of the Internat. Conf. on Robotics and Automation (ICRA, Montreal, QC, Canada, May 2019), pp. 6117-6123.
[8] Le Tien, L., Albu-Schaffer, A., De Luca, A., and Hirzinger, G., Friction Observer and Compensation for Control of Robots with Joint Torque Measurement, in Proc. of the IEEE/RSJ Internat. Conf. on Intelligent Robots and Systems (Nice, France, Sep 2008), pp. 3789-3795.
[9] Parikh, A., Kamalapurkar, R., and Dixon, W. E., Integral Concurrent Learning: Adaptive Control with Parameter Convergence Using Finite Excitation, Int. J. Adapt. Control Signal Process., 2019, vol. 33, no. 12, pp. 1775-1787.
[10] Isaly, A., Patil, O.S., Sanfelice, R. G., and Dixon, W.E., Adaptive Safety with Multiple Barrier Functions Using Integral Concurrent Learning, in American Control Conf. (ACC, New Orleans, La., USA, May 2021), pp. 3719-3724.
[11] Slotine, J.-J.E. and Li, W., Composite Adaptive Control of Robot Manipulators, Automatica J. IFAC, 1989, vol. 25, no. 4, pp. 509-519.
[12] Slotine, J.-J. E. and Li, W., Applied Nonlinear Control, Englewood Cliffs, N.J.: Prentice Hall, 1991.
[13] Kreisselmeier, G., Adaptive Observers with Exponential Rate of Convergence, IEEE Trans. Autom. Control, 1977, vol. 22, no. 1, pp. 2-8.
[14] Gerasimov, D. and Nikiforov, V., On Key Properties of the Lion's and Kreisselmeier's Adaptation Algorithms, IF A C-PapersOnLine, 2020, vol. 53, no. 2, pp. 3773-3778.
[15] Lion, P. M., Rapid Identification of Linear and Nonlinear Systems, AIAA J., 1967, vol. 5, no. 10, pp. 1835-1842.
[16] Shao, X., Hu, Q., Li, D., Shi, Y., and Yi, B., Composite Adaptive Control for Anti-Unwinding Attitude Maneuvers: An Exponential Stability Result without Persistent Excitation, IEEE Trans. Aerosp. Electron. Syst., 2022, vol. 59, no. 2, pp. 1051-1066.
[17] Slotine, J.-J.E. and Li, W., On the Adaptive Control of Robot Manipulators, Int. J. Rob. Res., 1987, vol. 6, no. 3, pp. 49-59.
[18] An, Ch.H., Atkeson, Ch. G., and Hollerbach, J. M., Estimation of Inertial Parameters of Rigid Body Links of Manipulators, in Proc. of the 24th IEEE Conf. on Decision and Control (Fort Lauderdale, Fla., USA, Dec 1985), pp. 990-995.
[19] Craig, J., Hsu, P., and Sastry, S., Adaptive Control of Mechanical Manipulators, in Int. J. Rob. Res., 1987, vol. 6, no. 2, pp. 16-28.
[20] Gautier, M., Dynamic Identification of Robots with Power Model, in Proc. of the Internat. Conf. on Robotics and Automation (Albuquerque, N.M., USA, Apr 1997): Vol. 3, pp. 1922-1927.
[21] Gautier, M., A Comparison of Filtered Models for Dynamic Identification of Robots, in Proc. of the 35th IEEE Conf. on Decision and Control (Kobe, Japan, Dec 1996): Vol. 1, pp. 875-880.
[22] Chowdhary, G. and Johnson, E., Concurrent Learning for Convergence in Adaptive Control without Persistency of Excitation, in Proc. of the 49th IEEE Conf. on Decision and Control (CDC, Atlanta, Ga., USA, Dec 2010), pp. 3674-3679.
[23] Pan, Y. and Yu, H., Composite Learning from Adaptive Dynamic Surface Control, in IEEE Trans. Autom. Control, 2016, vol. 61, no. 9, pp. 2603-2609.
[24] Khalil, W. and Dombre, E., Modeling, Identification and Control of Robots, Boca Raton, Fla.: CRC, 2002.
[25] Hsu, P., Sastry, S., Bodson, M., and Paden, B., Adaptive Identification and Control for Manipulators without Using Joint Accelerations, in Proc. of the IEEE Internat. Conf. on Robotics and Automation (Raleigh, N.C., USA): Vol. 4, pp. 1210-1215.
[26] Gerasimov, D. N., Nikiforov, V. O., and Belyaev, M. E., Performance Improvement of MRAC by Dynamic Regressor Extension, in Proc. of the IEEE Conf. on Decision and Control (CDC, Miami, Fla., USA, Dec 2018), pp. 3032-3037.
[27] Gerasimov, D.N. and Nikiforov, V. O., Augmented Error Based Adaptive Control with Improved Parametric Convergence, IFAC-PapersOnLine, 2022, vol. 55, no. 12, pp. 67-78.
[28] Yi, B. and Ortega, R., Conditions for Convergence of Dynamic Regressor Extension and Mixing Parameter Estimators Using LTI Filters, IEEE Trans. Automat. Control, 2023, vol. 68, no. 2, pp. 1253-1258.
[29] Ortega, R., Nikiforov, V., and Gerasimov, D., On Modified Parameter Estimators for Identification and Adaptive Control. A Unified Framework and Some New Schemes, Annu. Rev. Control, 2020, vol. 50, pp. 278-293.
[30] Astolfi, A., Karagiannis, D., and Ortega, R., Nonlinear and Adaptive Control with Applications, London: Springer, 2008.
[31] Chowdhary, G., Yucelen, T., Muhlegg, M., and Johnson, E. N., Concurrent Learning Adaptive Control of Linear Systems with Exponentially Convergent Bounds, Internat. J. Adapt. Control Signal Process, 2013, vol. 27, no. 4, pp. 280-301.
[32] Lavretsky, E., Combined/Composite Model Reference Adaptive Control, IEEE Trans. Automat. Control, 2009, vol. 54, no. 11, pp. 2692-2697.
[33] Cohen, M.H. and Belta, C., Modular Adaptive Safety-Critical Control, Proc. of the American Control Conf. (ACC, San Diego, Calif., USA, 2023), pp. 2969-2974.
[34] Zhang, Z., Liu, F., Liu, T., Qiu, J., and Buss, M., A Persistent-Excitation-Free Method for System Disturbance Estimation Using Concurrent Learning, IEEE Trans. Circuits Syst. I Regul. Pap., 2023, vol. 70, no. 8, pp. 3305-3315.
[35] Hu, Q., Shao, X., and Guo, L., Composite Learning Pose Control of Spacecraft with Guaranteed Parameter Convergence, in Intelligent Autonomous Control of Spacecraft with Multiple Constraints, Singapore: Springer, 2023, pp. 267-296.
[36] Carpentier, J., Saurel, G., Buondonno, G., Mirabel, J., Lamiraux, F., Stasse, O., and Mansard, N., The Pinocchio C++ Library: A Fast and Flexible Implementation of Rigid Body Dynamics Algorithms and Their Analytical Derivatives, in Proc. of the IEEE/SICE Internat. Symp. on System Integration (SII, Paris, France, 2019), pp. 614-619.
[37] Andersson, J. A. E., Gillis, J., Horn, G., Rawlings, J.B., and Diehl, M., CasADi: A Software Framework for Nonlinear Optimization and Optimal Control, Math. Prog. Comp., 2019, vol. 11, no. 1, pp. 1-36.