Электронный журнал «Труды МАИ». Выпуск № 34
www.mai.ru/science/trudy/
Decentralized formation control considering information propagation
R. Funase, M. Arakawa, Y. Tsuda and J. Kawaguchi
Abstract
There is a growing interest in formation flight of satellites and its control strategy. The centralized control architecture is optimal when all information is shared by every member in the formation, however, it possibly shows unstable behavior in a realistic situation where the information exchange cost (for example, communication load) is large and the information can be only partially shared. This paper proposes a new decentralized control strategy for multi-vehicle formation control, which effectively propagates the information between adjacent agents and uses the information in the control calculation. The strategy can overcome the drawback of the centralized approach while maintaining the communication load very low. The effectiveness of the proposed strategy is numerically verified by applying to one-dimensional car train control problem and two-dimensional in-plane satellite formation control problem.
1. Introduction
Recently the formation flying has become very common and its keeping strategies are now of great interest in Astrodynamics these days. It may be simply true that the optimal station keeping is realized only when every spacecraft information is shared by every member in the formation. This is the case called "full informed" state and the strategy taken results in a centralized control. However, in case the number of members constituting a formation becomes infinitely large, the communication load of the system becomes divergent and the states uncertainty may deform the relative geometry before the entire information is exchanged.
Instead of this centralized control law, it is practical to apply a localized control law that requests only the information of relative motion among a few spacecraft adjacent to each other. In this case, the strategy will be a decentralized one. In the decentralized control, as the information is not shared by every member, the optimal control is impossible and non-optimal transition behavior is observed. A typical example of this non-optimal transition behavior is the traffic jam phenomenon in highways on the ground. When the head car of platoon rises/slowdowns in the speed, oscillatory phenomenon of the car density like compressional wave is observed, and in the
worst case, the platoon shows divergent behavior. This is because that the member cannot use the full information of the platoon or formation. The partially informed control makes the transition behavior not uniformly converged to the goal and the settle time increases. This problem is also observed in the satellite formation control and some appropriate control strategy should be implemented.
This paper first deals with the formation behavior via z-transformation method. The z-transformation expression can easily express the transition behavior of the partially informed formation. A uniformly convergent strategy is proposed in the z-transformed formulation, which physically indicates that the information of the relative motion and the resulted control input is propagated from one agent to the adjacent one. This information propagation scheme was proved to stabilize the formation behavior.
This section deals with the control of a car platoon as a typical example of one-dimensional formation control.
Fig. 1 shows the typical one-dimensional formation. A specific position of each vehicle is denoted as xi and an acceleration input to the each vehicle is denoted as ui. That is:
2. One-dimensional platoon control
2.1 Mathematical formulation via z-transform
X = U;
(1)
where, usingyi as a distance between vehicles, if the definition is given as follows:
y = -
(2)
and thus
(3)
X
n+2 n+1
n
n-1 n-2
Fig. 1: One-dimensional formation
In usual traffic, the driver usually inputs the control of inter-vehicle distance, which results in
yi+(01yi=(01yi_!
U =®2(y -y)
to maintain the
(4)
This relation very simply indicates how the traffic flow is destabilized. The foregoing cars naturally have same characteristic frequency and the following cars must suffer from the oscillatory reference input, to which the acceleration resonances. Since most of drivers have very similar characteristic frequency, this resonant tendency governs and make the cars flow density amplitude increased seriously.
This is further z-transformed spatially to give a result expressed as follows:
Jz=-uz (5)
z
The z-operator included in the right-hand side indicates a delayed transmission, and it is essential to offset and delete the delayed transmission from the viewpoint of control logic in order to provide a stable vehicle train control. Although this has been also previously reported, a novel interpretation thereof will be herein added. A first consideration is directed to the case where a driver attempts to control a distance between his/her vehicle and a vehicle traveling ahead thereof. In this case, assuming that a speed error feedback gain and an inter-vehicle distance error feedback gain are denoted as Kd and Kp, respectively, then the control logic is expressed to be:
u = -Kd (x, - x) + Kp (J, - y,) (6)
or, when z-transformed,
U =-Kd(xz -\) + Kp(yz -yz) (7)
It is to be noticed that in this expression, a variable added with a lateral bar on top indicates a target value for the variable. Since from the following expressions:
yz =X (8)
z
*.=r~y. (9)
1 — z
are given (attention should be pain on the minus sign), a closed-loop characteristic follows
yz =-Kd {yz -X}-KpZ-1 {yz - yz} (10)
Herein, a simple interpretation of the above characteristic is given. Without loosing generality, ~ is assumed, where, Kd is sufficiently small and replaced as p , then a
solution under an initial condition of yzo is given in
y = (yz0 - y }c°s
'P& ;
+ y (11)
If the above equation is expanded by (1/z) and resultant coefficients are evaluated, then the inter-vehicle distance can be determined. A first term gives a propagation of a difference between an initial condition and a target of control to a vehicle train, wherein the n-th inter-vehicle distance from the head of the train can be given by a expansion factor of (1/z)n , and in this simplified model, it may be given in
±(I«,'J cos(»-f) 02)
Theoretically, the phase should be different by 90 degrees for every inter-vehicle distance. This amplitude may be:
a. = . (2 «V J 03)
<W ^ o
And if within an finite time, a. , thus, the inter-vehicle distance would not diverge. Contrary, for a finite train length, the inter-vehicle distance would diverge over time. In a sequence of inter-vehicle distance, ultimately, the amplitude causes a vibration expressed as:
0 - y,\[ 1 t j (14)
to be produced in the automobiles in the downstream train, which is further superposed, leading to the traffic congestion. Fig. 2 shows the example of such divergent behavior.
It could be said that in order to avoid the congestion, effectively Q<<1 should be given to reduce sensitivity for triggering the acceleration or deceleration operation in response to the inter-vehicle distance, though it could not be essentially avoided that the vibration propagated divergently to the automobiles in the downstream train would be induced. Consequently, if any disturbance in the inter-vehicle distance occurs, then the congestion should be induced without exception in the conventional method, in which the control is given by the driver who judges a distance with respect to the automobile ahead.
resonance phenomenon 450 -1-1-1-1-r
time[s]
Fig. 2: Divergent behavior of car train
2.2 Extension to a continuous system
To understand the dynamics of the system more deeply, the differential equation for discrete integer i (i=0,1,...) is extended to any continuous i between 0 and 1. By eq.(3) and (6), the discrete expression is
y^-KJ-K^y-y, ,) (15)
where, it is assumed that ^ and are same for all i. If the variable i takes a continuous value between 0 and 1, this equation can be rewritten as
y^-Kpiy^y^-Kjy,
= -Kp^f--Kdy (16)
J__i_L yt
n n "
So, the continuous expression can be obtained as
y=~Kp^-Kdy (17)
If the general solution of eq. (17) is assumed to be
y, =a(t)P(i) (18)
the following differential equation can be obtained by eq. (17) and (18):
a a B'
?L + Kda=_K p_ = A (const) (19)
a a p
Therefore, by introducing a constant □, the following two governing equations are obtained:
a + Kda-Aa = 0 (20)
—
P' + — P = 0 (21)
Kp
Clearly, when □ > 0, □ is unstable and will diverge. As □ is determined by initial condition, eq.(20) states that the time history of the inter-vehicle distance can be unstable depending on the initial condition. Eq.(21) shows that the inter-vehicle distance can be propagated along the platoon. This phenomenon is interpreted as the traffic jam.
2.3 Control law with information propagation structure
In this section, a control law is proposed which has an information propagation scheme
between adjacent vehicles.
By eq.(7) and (8), the conventional control law is rewritten as
- 1 - z
U =-Kd(Xz -Xz) +-Kp(Xz -Xz) (22)
z
Here, we add a new term into the above control law to obtain a new control law as
U =-Kd (Xz - X ) + — Kp (Xz - Xz) - - Kp (xz - xz) (23 )
z z
We define the additional term -1K (xz - xz) as wi. wi can be interpreted as follows:
z
~-Kp{x2 -x2) = -K^-x^) = Kp{yi_x — JJ. j) — Kp(xj 2 -x,_2) □ (24)
Then, the control law can be divided into these two equations
U = -Kd (X, - X,) + Kp (y - y,) + (25)
^ = ^-i + Kp(y-i -y-i) (26)
where, wi can be considered as information propagated from foregoing car. And the dynamics of the system is given by
yz =-Kd {yz -yz}-Kp{yz -yz} (27)
or
yt =-Kd {y, -y,}-Kp{y, -y,} (28)
It can be said that by accumulating and propagating the information wi, the dynamics which was affected by the foregoing car (see eq.(4)) becomes independent of i, and the behavior of the inter-vehicle distance can be stable.
By extending eq.(28) to a continuous expression, we can obtain
y=-Kp(yi-yi)-KJi (29)
which does not contain the partial derivative with respect to i. This means that the dynamics is independent in the along-track direction. The time response can be stable by appropriately selecting Kp and Kd.
2.4 Effect of information propagation delay
The previous section assumed that there is no information propagation delay in the system. This section introduces the effect of information propagation delay into the system dynamics and investigates the resulting dynamics.
Instead of the propagation scheme eq.(26), the following scheme is assumed:
w,. = wi_1(t— At) + Kp (yi_1(t) — yj (30)
where At is the delay time between adjacent vehicles. By accumulating this equation,
,—i
w.
Kp (yk (t — k At ) — yk ) (31)
k=0
So, the control law (eq.(25)) becomes
_ i—1 U =—Kd ( X, (t) — X, ) + Kp (y. (t) — yt ) + ^ Kp (yk (t — kAt) — yk ) (32)
ykK
k=0
By substituting this equation into eq.(3), we can obtain the Laplace-transformed expression as
7 —1 1
s2Y = KP-Y — y,) — KdsYi + Kp -e-Ats Y — y,) (33)
z z
dY
Then, by using Y_i ~ Y — Y', Y — Y-i = —L (n ^(X>), continuous expression is obtained as:
8i
r}Y
[s2 + Kds + Kpe-Ats ](Y — yt) + Kp (1 — e~Ms) ^ = 0 (34)
The inverse Laplace transform of this equation leads to the following time-domain expression:
m + Kdm + Kp(yXt-At)-yi) + Kp^-Kp^t^ = 0 (35)
Here, we assume that At is small and yt{t - At)« yt{t) -yt{t)At,
m + iK.-AtKp^ + K.iy^-y^ + AtKp ^ = 0 (36)
is obtained.
When the following general solution is assumed and substituted into eq.(36),
y=a(t)P(i) (37)
we can obtain the equation of a and P as
à \adt r'
- + {KJ-MKII) + KI±-= -A tKpZ- = A (const) (38)
a ff a F p
Therefore, by introducing a constant X, the following two governing equations are obtained:
À + (Kd-AtKp-Â)À + KpA = 0 (39)
X
P' =--—P (40)
P AtKpP ( )
where A = jadt. It should be noted that X if At ^ 0 (see eq.(38)). The general solution of eq.(36) is written as follows:
y(t) = A(t )P(i) + y (41)
By comparing eq.(39) and (29), the propagation delay t can make the system unstable and the traffic jam can occur. By also comparing eq.(40) and eq.(21), the tendency of how the system becomes unstable can be controlled by the value Dt. That is, the new control law with information propagation structure can suppress the traffic jam by selecting appropriate Dt, while simple PD control (conventional control) can never.
2.5 Numerical simulation results Simulation results as discrete system
The performance of the conventional control law and the new one is evaluated by numerical simulations. The system is assumed to be discrete system (section 2.1).
Fig. 3 and 4 show the traffic control example. Distance between 1st and 2nd cars abruptly extends to 160m and the distance between 5th and 6th cars suddenly becomes 150m. Here is assumed the driving skill is common. Fig. 4 shows the result obtained by the usual control law, eq. (13). Rear cars' distances diverge gradually. Fig. 5 shows the result obtained by the new control law shown in eq. (21). As this figure shows, the Dramatic improvement is obtained. Small localized oscillation is found but is damped quickly.
E
^ 200
>
ID
0
J 150
w 73 JD
.2 100 -C
<D >
1
(D +J
C
50
A /?
\ I
/ Y\ 7>*
i ik^^ 1 1
4 6
i
10
200
180
i—i E 1_1 160
> 0 140
o
c <0 120
w
"O 100
0
o V 80
0
> 1 60
0
+J 40
20
0
Fig. 3: Control result by conventional control law
0s
10s
20s
30s
40s
50s
60s
70s
10
Fig. 4: Control result by new control law
Simulation results as continuous system
Next, simulations are performed as a continuous system, and the effect of information propagation which is discussed in section 2.4 is evaluated.
Fig. 5 and 6 shows the comparison of traffic control for a continuous system, where no information propagation delay is applied. The feedback gain Kp and Kd was 0.5/100, 1.0 [m/s2]
0
0
2
8
0
2
4
6
8
respectively. The initial disturbance condition was such that the inter-vehicle distance 0.99 was applied at i=0.5. The horizontal axis in the figure shows continuous i, and the vertical axis is the transient time [s]. In Fig. 5, divergent behavior is observed from 20 sec and the behavior propagates to i=0.4-0.7 at 60 sec, so the conventional control could not suppress the divergence. On the other hand, in Fig. 6, the new control law with information propagation structure could suppress the initial disturbance within 10 seconds.
Then, the effect of the information propagation delay is imposed. Fig. 7 and Fig. 8 shows the simulation results of new control law when the propagation delay is 1sec and 2sec respectively. In Fig. 7, the initial disturbance is suppressed by 60 seconds. However, in Fig. 8, the initial disturbance is growing. So, the new control law cannot overcome the time delay of 2 seconds.
Fig. 5: Control results of conventional control law (continuous system)
Fig. 6: Control results of new control law (continuous system)
Fig. 7: Control results of new control law (At=1 [s])
Fig. 8: Control results of new control law (Àt=2 [s])
3. Application to satellite formation control
In this section, the proposed control law with information propagation scheme is applied to satellite formation control problem.
3.1 Mathematical formulation
Here, we consider two-dimensional (in-plane) formation keeping problem around a circular reference orbit.
The equation of relative motion is described by the two-dimensional Hill's equation:
x - 2 ny - 3 n2x = u , s
' (42)
y + 2m = v
where n denotes to the orbital angular velocity of the reference circular trajectory and x and y is the relative position of a satellite in the anti-nadir and along-track direction respectively.
When a contraction of x - x> y -y/2 is used, and the rotation coordinate transformation of
p -
cosO - sinOV x ' ^ V y ' y
vsin0 cosO y
(43)
is introduced, the equations of motion is altered to the following time-varying expression.
p + A(6)p =
C cosO
' 1 — v
V 2 y
sin# cosO
- U
(44)
where
f
A(0) - n
3 sin 0 cosO 1 - 3cos2 0 -1 + 3sin2 0 -3 sin 0 cos 0
(45)
and 9 is the true anomaly of the reference trajectory. The p-coordinate is rotating in reverse direction of the reference orbit as shown in Fig. 9.
Fig. 9: Coordinate transformation In this coordinate, it is clear that a trivial solution, a constant vector always satisfies the equation above. This corresponds to the state in which the formation is frozen along similar ellipse trajectories, i.e. like cart wheel orbits. This section deals with a station-keeping problem in this coordinate.
The relative motion in the coordinate can be expressed as:
¿Spv+Ap» =U,
i-W-i Ujj
iûi-1 - u^ V vj-i - Vj y
(46)
where Ap. - p^ - p. . This can be converted via z-transformation as:
+ Ap__ =
± -1
0--1
z„
U
(47)
By applying the new control law proposed in the previous section,
u. =
^ -1
0
1 y
(icp + kp)
This is decomposed by the inverse z-transform as:
u = cos da + sin 0Bm
n * m
1 v = - sin da + cos dBm
^ n ' m
ccn=ccn_x+cAin+kAi;n
Pm=Pm i +cAnm+knm
(49)
where an and pm are the value to be propagated between adjacent satellites.
3.2 Numerical simulation results
Fig. 10 shows the trajectories of four satellites that are initially positioned at the origin (reference trajectory). The target position of each satellite in the □□□ coordinate is (1,1), (1,2), (2,2) and (4,2) respectively. The new control scheme guarantees uniform convergence to the target point and the entire formation is controlled.
Fig. 10: Results of 2-dimensional formation control
4. Conclusion
This paper proposed a decentralized control law for multi-vehicle formation. The proposed method has an information propagation structure between adjacent agents, so that the transient divergent behavior due to the partially informed situation was completely suppressed. The proposed method was found to have a drawback that the transient behavior of the formation can diverge when there is large information propagation delay, however, the divergent property can be controlled by appropriately selecting the feedback control gains.
This method was applied to one-dimensional car train control problem and two-dimensional satellite formation control problem, and the effectiveness was verified by numerical simulations.
e treated with hods, the search will be executed by Heuristic Methods, mainly Genetic Algorithms (GA). thatellite with respect
References
1. Y. Zhang, Elias B. Kosmatopoulos and Petros A. Ioannou, "Autonomous Intelligent Cruise Control Using Front and Back Information for Tight Vehicle Following Maneuvers," IEEE Transactions on vehicular technology, Vol 48, No.1, 1999
2. M. E. Khatir and E. J. Davison, "A Decentralized Lateral-Longitudinal Controller for a Platoon of Vehicles Operating on a Plane," American Control Conference, 2006
Authors
R. Funase, Institude of Space and Astronautical Science (ISAS), Japan Aerospace Exploration Agency (JAXA), Japan. Contacts: funase@isas.jaxa.jp
Y. Tsuda, Institude of Space and Astronautical Science (ISAS), Japan Aerospace Exploration Agency (JAXA), Japan.
J. Kawaguchi, Institude of Space and Astronautical Science (ISAS), Japan Aerospace Exploration Agency (JAXA), Japan.
M. Arakawa, Tokyo Denki University, Japan. Address: 3-1-1 Yoshinodai, Sagamihara 229-8510, Japan.