Научная статья на тему 'DIFFERENTIAL GAME WITH A LIFELINE FOR THE INERTIAL MOVEMENTS OF PLAYERS'

DIFFERENTIAL GAME WITH A LIFELINE FOR THE INERTIAL MOVEMENTS OF PLAYERS Текст научной статьи по специальности «Математика»

CC BY
25
5
i Надоели баннеры? Вы всегда можете отключить рекламу.
Журнал
Ural Mathematical Journal
Scopus
ВАК
Область наук
Ключевые слова
DIFFERENTIAL GAME / PURSUIT / EVASION / ACCELERATION / STRATEGY / ATTAINABILITY DOMAIN / LIFELINE

Аннотация научной статьи по математике, автор научной работы — Samatov Bahrom T., Soyibboev Ulmasjon B.

In this paper, we study the well-known problem of Isaacs called the “Lifeline” game when movements of players occur by acceleration vectors, that is, by inertia in Euclidean space. To solve this problem, we investigate the dynamics of the attainability domain of an evader through finding solvability conditions of the pursuit-evasion problems in favor of a pursuer or an evader. Here a pursuit problem is solved by a parallel pursuit strategy. To solve an evasion problem, we propose a strategy for the evader and show that the evasion is possible from given initial positions of players. Note that this work develops and continues studies of Isaacs, Petrosjan, Pshenichnii, Azamov, and others performed for the case of players’ movements without inertia.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «DIFFERENTIAL GAME WITH A LIFELINE FOR THE INERTIAL MOVEMENTS OF PLAYERS»

URAL MATHEMATICAL JOURNAL, Vol. 7, No. 2, 2021, pp. 94-109

DOI: 10.15826/umj.2021.2.007

DIFFERENTIAL GAME WITH A LIFELINE FOR THE INERTIAL MOVEMENTS OF PLAYERS

Bahrom T. Samatov^, Ulmasjon B. Soyibboev^

Namangan State University, 316 Uychi Str., Namangan, 116019, Uzbekistan

i samatov57@inbox.ru, ^ ulmasjonsoyibboev@gmail.com

Abstract: In this paper, we study the well-known problem of Isaacs called the "Lifeline" game when movements of players occur by acceleration vectors, that is, by inertia in Euclidean space. To solve this problem, we investigate the dynamics of the attainability domain of an evader through finding solvability conditions of the pursuit—evasion problems in favor of a pursuer or an evader. Here a pursuit problem is solved by a parallel pursuit strategy. To solve an evasion problem, we propose a strategy for the evader and show that the evasion is possible from given initial positions of players. Note that this work develops and continues studies of Isaacs, Petrosjan, Pshenichnii, Azamov, and others performed for the case of players' movements without inertia.

Keywords: Differential game, Pursuit, Evasion, Acceleration, Strategy, Attainability domain, Lifeline.

1. Introduction

Differential game theory deals with conflict problems in systems expressed by differential equations. As a result of developing Pontryagin's maximum principle, it became apparent that there was a link between optimal control theory and differential games. Actually, problems of differential games describe a generalization of optimal control problems in cases where more than one player is involved.

The study of differential games was initiated by American mathematician R. Isaacs. His research was published in the form of a monograph [20] in 1965, in which a great number of examples were considered, and theoretical questions were only affected. Differential games have been one of the basic research fields since 1960, and their fundamental results were gained by Pontryagin [29], Krasovskii [23], Bercovitz [5], Dar'in and Kurzhanskii [9], Elliot and Kalton [10], Isaacs [20], Fleming [11], Friedman [12], Hajek [14], Ho, Bryson and Baron [15], Petrosjan [28], Pshenichnii [30, 31], Subbotin [40, 41], Ushakov [42], Chikrii [8], and others.

The monograph of Isaacs [20] includes certain game problems that were discussed in detail and put forward for further study. One of these problems is called the "Lifeline" problem, which was initially formulated and solved for certain special cases [20, Problem 9.5.1]. A simplified analytical solution to this problem in the half-plane was proposed by Isaacs in [20]. For the case when controls of both players are subjected to geometric constraints, this game was rather thoroughly considered by Petrosjan [28] based on approximating measurable controls with the most efficient piecewise constant controls that realize the parallel convergence strategy. Later, this approach to control in differential pursuit games was termed the n-strategy. The strategy proposed in [28], for a simple pursuit game with geometric constraints, became the starting point for the development of the pursuit method in games with multiple pursuers [6, 13, 39]. In 1986, a simplified analytical solution of the "Lifeline" problem was suggested by A. Azamov [3]. Later on, for the cases when controls of both players are subjected to integral, Gronwall or mixed constraints, this game was investigated in the works of Samatov [4, 32-37].

Based on the fundamental approaches in the theory of differential games developed by Pontrya-gin [29] and Krasovskii [23], a differential game is considered as a control problem from the point of view of either pursuer or evader. According to this view, the game reduces to either pursuit (convergence) problem or evasion (escape) problem. The main technique for solving the pursuit and evasion problems is constructing optimal strategies of players and defining the value of the game. The works [16-19, 21] are devoted to studying differential games of simple motion and by optimal strategies of players, it was proved that the value of the game exists.

In [24, 25], the classic time-optimal differential games with a lifeline were investigated. The first player seeks to lead the system to a given closed terminal set with a smooth boundary, and the second player strives to guide the same system to another given set whose boundary is also smooth and is also called a lifeline. To solve this problem, the authors adhered to the formalization of positional differential games proposed by N.N. Krasovskii and A.I. Subbotin.

In the present paper, we discuss the pursuit-evasion problems and the "Lifeline" game for the inertial movements of players. We impose geometric constraints on controls of the players. In order to solve a pursuit problem, we suggest the n-strategy for the pursuer and prove that this is an optimal strategy. After that, necessary and sufficient condition of pursuit is originated and optimal pursuit time (guaranteed pursuit time) is determined. To solve an evasion problem, we propose a strategy for the evader and show that the evasion is possible from given initial positions of the players. Here, any closed set given in the space is considered as a lifeline. In this case, the first player (a pursuer) aims to coincide with the second player (an evader) as quickly as possible and, by this occurrence, a trajectory of the evader shouldn't intersect the lifeline. The aim of the evader is to reach the lifeline by the time of the coincidence or is not to encounter the pursuer during the game. To solve the "Lifeline" problem, conditions of monotone embedding in respect to time for an attainability domain are given. In this paper, the statement and solution method of a differential game with a lifeline differ significantly from those from the works [24, 25]. Results of the paper rely on the works [1, 7, 22, 26-29, 42, 43] and adjoin the works [2-4, 20, 38, 41].

Assume that in the space Rn a controlled object P, called a pursuer, chases another object E, called an evader. Denote by x a state of the pursuer and by y that of the evader in Rn.

Let the motion dynamics of the players be generated by the following differential equations and initial conditions respectively:

where x, y, x0, y0, xi, yi, u, v € Rn, n > 2; x0 and y0 are initial states of the players, and xi and yi are their initial velocity vectors, respectively. We suppose that x0 = y0 and xi = yi.

Here the temporal variation of u must be a measurable function u(-) : [0, œ) ^ Rn, and we impose a geometrical constraint on this vector-function (briefly, G-constraint) in the form

where a is a given positive parametric number, which means the maximal acceleration value of the pursuer.

In a similar way, the temporal variation of v must be a measurable function v(-) : [0, oo) ^ Rn, and we impose on this vector-function a G-constraint in the form

2. Statement of problem

P : x = u, x(0) = x0, x(0) = xi, E : y = v, y(0) = yo, y(0) = yi,

(2.1) (2.2)

|u(t)| < a almost everywhere t > 0,

(2.3)

|v(t)| < almost everywhere t > 0,

(2.4)

where P is a given non-negative parametric number, which means the maximal acceleration value of the evader.

The control parameters of the players are the acceleration vectors u and v, which depend on time t > 0. We denote by U and V the set of all control parameters u and v satisfying conditions (2.3) and (2.4), respectively.

Definition 1. We call the measurable function u(-) (v(-)) that satisfies the condition (2.3) (the condition (2.4)) an admissible control of the pursuer (the evader) of the class U (the class V), where the pair of classes of admissible controls introduced (U, V) defines a differential game.

By means of the equations (2.1) and (2.2), each triplet (x0,x1,u(-)) and (y0,y1,v(-)) generates the trajectories of motion

t

x(t) = x0 + x1t + J (t — s)u(s)ds, (2.5)

0

t

y(t) = y0 + y1t + y"(t — s)v(s)ds (2.6)

0

of the pursuer and evader respectively.

Suppose that a closed subset M called a lifeline is given in the space Rn. The main goal of a pursuer P is to catch an evader E, that is, to achieve the equality x(t*) = y(t*) at some time t*, t* > 0, while the evader remains in the zone Rn \ M. The goal of the evader is to reach the zone M before being caught by the pursuer or to maintain the relation x(t) = y(t) for all t, t > 0. We should note that M does not restrict the motion of the pursuer. Further, it is assumed that the initial states x0 and y0 are given under the conditions x0 = y0 and y0 € M.

It is known that control functions depending only on the time parameter t, t > 0,, are insufficient for the pursuer to solve the pursuit problem, and suitable types of control must be strategies. There are several methods for defining such a concept. Below we will give some concepts to define.

First, we introduce the following notation:

z(t) = x(t) — y(t), Z0 = X0 — y0, ¿(0) = X1 — y1.

Definition 2. A function u : Rn x Sp ^ Sa is called a strategy of the pursuer if the following conditions are valid.

(a) u(z0, v) is a Borel measurable function in v, v € V.

(b) Admissibility: The inclusion u(z0, v(-)) € U holds for each v(-) € V on some time interval [0, t]. In this case, the function u(z0, v(-)) is called a realization of the .strategy u(z0, v).

(c) Volterra property: If v1 (s) = v2(s) almost everywhere on [0, t] for every v1(-), v2(-) € V, and t, t > 0, then u1(s) = u2(s) almost everywhere on [0, t], where u^(-) = u(z0, vj(■)), i = 1, 2; Sa and Sp are the balls with radii a and P, respectively, centered at the origin of the space Rn.

Definition 3. A strategy u = u(z0, v) is called a parallel pursuit strategy, or n-strategy if, for any v(-) € V, a solution z(t) of the Cauchy problem

z = u(z0, v(t)) — v(t), z(0) = 20, z(0) = 0 (2.7)

can be expressed as

z(t) = zoA(i,v(0), A(0,v(-)) = 1,

where A(t,v(-)) is a scalar function of t, t > 0. Usually, this function is called a convergence function in the pursuit problem.

Definition 4. In the pursuit problem, it is said that a n-strategy guarantees catching an evader on the time interval [0,tg] if, for every v(-) € V,

(a) there exists some time t*, t* € [0, tg] that generates the equality z(t*) = 0;

(b) the inclusion u(z0,v(-)) € U is satisfied on the time interval [0,t*]. Here, the number tg is called a guaranteed pursuit (or capture) time.

Definition 5. We call the function v* : R+ ^ Rn a strategy of the evader if v* (t) is a Lebesgue measurable function in t.

Now we will consider the game (U, V) from the standpoint of an evader.

Definition 6. In the evasion problem, it is said that a control v* (■) € V guarantees escaping if, for any u(-) € U, a solution z(t) of the Cauchy problem

z = u(t) - v*(t), z(0) = zo, z(0) = 0 (2.8)

is nonzero, that is, z(t) = 0 for all t > 0.

This paper is dedicated to solving the following problems when the controls of the players are subject to constraints (2.3) and (2.4), respectively.

Problem 1. Pursuit problem: Construct a n-strategy of the pursuer and find the guaranteed capture time in the game (U, V).

Problem 2. Evasion problem: Construct a strategy of the evader and evaluate how to vary distance between the players.

Problem 3. Solve the "Lifeline" game.

3. A solution of the pursuit problem

In a great number of mathematical problems with parameters, an interesting property of the final analytic results is their explicit dependence on these parameters, which are regarded as constants in the solution. However, these parameters can help to determine feasibility conditions for these problems. In this section, we are going to present necessary and sufficient feasibility conditions for the pursuit problem in the game (U, V).

If the pursuer and evader choose admissible controls u(-) € U and v(-) € V, respectively, then, depending on equation (2.7), we obtain the solution

t

z(t) = zo + J(t - s)(u(s) - v(s))ds. (3.1)

o

In the new notation introduced, the goal of the pursuer is now to fulfill the equality z(t*) = 0 at some time t*, t* > 0. As for evader's goal, it is to maintain the relation z(t) = 0 for all t > 0.

For the pursuer, it is not enough to achieve his goal only by program strategies, i.e., admissible controls depending only on time t. Therefore, similarly to [4], in the case under consideration, the strategy of the pursuer can also be determined depending only on the current states of the acceleration function v(t), t > 0, and given constants z0 and a.

For solving the pursuit problem, suppose that at the current time t, the pursuer is aware of the initial parameters x0, y0, x1, y1, and the constants a, P, the current time t, and the value of evader's control v(t).

Definition 7. Assume that a > p. Then, in the game (U, V), we call the function

u(z0, v) = v — A(z0, v)&, (3.2)

a parallel pursuit strategy (briefly, n-strategy) of the pursuer, where

X(zq,v) = {v,£o) + v/<v,^o)2 + a2-M2, Co = zq/\zq\, (3.3)

and (v,£0) is the scalar product of the vectors v and £0 in Rn. The function A(z0,v) is usually called a resolving function.

Now we will indicate some important features for the strategy (3.2) and the resolving function (3.3).

Lemma 1. The strategy (3.2) is defined and continuous for all v € Sp, and the equality |u(z0 ,v)| = a holds during the pursuit game.

Lemma 2. The resolving function (3.3) is defined, continuous, and non-negative for all v € Sp, and this function is bounded as follows:

a — P < A(z0, v) < a + p.

Definition 8. If a > P, then the scalar function

t

A(t, v(-)) = 1 - -i- I (t - s)\(z0,v(s))d,s (3.4)

|z0| J 0

is called a convergence function of the players in the game (U, V).

Lemma 3. Let a > P. Then

(a) for all v(-) € V, the function (3.4) is monotone decreasing with respect to t, t > 0;

(b) the function (3.4) is bounded for all t € [0, tg] as follows:

A1(t) < A(t,v(-)) < A2(t), (3.5)

where

t2 t2 Ai(t) = l~-^(a + f3), A2(i) = l--—(a-p).

Proof. (a) According to Lemma 2, it follows that

t

dA(t, v(-)) 1 / , . t .

-Hr21 = "N J X{zoMs))ds" "N(a- "» <

o

(b) Relying on the minimum lemma in the elementary optimal control problem [1, p. 360], we get the following estimation:

t

1 f t2

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

|zo| v(-)ev J 2|zo| M<£

o

On the other hand, by Lemma 2, we have

t

t2 1 f

Ai(i) = 1--:—r max X(zo,v) = 1 — ,—r max (t — s)X(zo,v(s))ds < A(t, v(-)).

2|zo| |zo| v(-)ev J

o

This completes the proof. □

Theorem 1. Let a > /3 in the game (U, V). Then the II-strategy (3.2) guarantees catching the evader on the time interval [0,tg], where tg = \j2\zo\/(a — ¡3).

Proof. Assume that the evader chooses some control v(-) € V while the pursuer implements the n-strategy (3.2). Then, by (3.1), we obtain the function

t

z(t) = zo + J(t - s)A(zo, v(s))£ods,

which can be written as follows:

z(t) = zoA(t,v(-)), (3.6)

where A(t, v(-)) is the players convergence function of the form (3.4). Taking into account the right-hand side of (3.5), we conclude that the function A2(t) is equal to zero at t = tg. Therefore, there exists some t* € [0,tg] such that A(t*, v(-)) = 0, and this (see (3.6)) results in z(t*) = 0. Theorem 1 is proved. □

4. A solution of the evasion problem

In this section, we will suggest an admissible strategy for the evader, which guarantees escaping in the evasion problem. Using this strategy, we will prove that the strategy (3.2) is an optimal pursuit strategy and tg is an optimal pursuit time.

Definition 9. We call the control function

v*(t) = -^o (4.1)

a strategy of the evader in the game (U, V).

Definition 10. It is said that the strategy v*(t) guarantees escaping on the time interval [0, tg) if for any control function of the pursuer u(-) € U, the relation z(t) =0 is valid for all t € [0, tg), where z(t) is the solution of the Cauchy problem (2.8).

Theorem 2. (a) Let a > (. Then the .strategy (4.1) guarantees escaping on the time interval [0, ) in the game (U, V), where tg is the guaranteed pursuit time (see Theorem 1).

(b) Let a < (. Then the strategy (4.1) guarantees escaping on the time interval [0, in the game (U, V), and the distance between the players is estimated as follows:

f |zo|, if a = 3,

\z(t)\>\ (g — f3)t2

I M--2-' ^ a 1

Proof. (a) Assume that a > ( and the pursuer picks a control u(-) € U while the evader applies the strategy (4.1). In accordance with (3.1), it follows the function

t t

z(t) = zo + J(t - s)u(s)ds + J(t - s)({ods. (4.2)

0 0

Estimate the absolute value of (4.2) as follows:

t t |z(t)| > zo + J(t - s)({ods - J(t - s)u(s)ds >

0 0 t

> M (l - ) - /(i - = - f (a - /?).

Relying on Theorem 1, we can write the estimation

for all t, 0 < t < tg.

(b) Suppose that a < (.In this case, a proof is similar to the proof of item (a), i. e.,

\z(t)\ > \zo\ - {a ~f)t2 >N>0 for all t € [0, Theorem 2 is proved. □

5. An attainability domain of the pursuer

In accordance with Theorem 1, if a > ( then, by the n-strategy (3.2), the evader is captured at some point in the space Rn. In the considered game, we will construct a set of meeting points of the players for the case a > (.

Let a triplet (y0, yi, v(-)), v(-) € V, generates a trajectory of an evader E in the form (2.6) while a triplet (x0,x1, u(z0,v(-)), u(z0,v(-)) € U, generates a trajectory of a pursuer P in the form

t

x(t) = x0 + x1t + J(t - s)u(z0, v(s))ds, (5.1)

0

where t € [0, t*], 0 < t* < tg, and t* is the encounter time of the players, that is, the equality x(t*) = y(t*) holds. Thus, for each pair (x(t),y(t)), we form the set

W(t) = W(x(t),y(t)) = {w : |w - x(t)| > (a/()|w - y(t)|}, (5.2)

on [0,t*]. Note that

W(0) = W(xo,yo) = {w : |w - xo| > (a/p)|w - yo|}. Since |z(t)| > 0 on [0, t*], it is obvious that the inclusion y(t) € W(t) is valid on this time interval.

Remark 1. Note that the trajectories x(t) and y(t) of the players and the multi-valued mapping W(t) directly depend on the choice of a control v(-) € V. This dependence is omitted for brevity.

Lemma 4. The multi-valued mapping W(t) can be expressed as

W(t) = x(t) + A(t, v(-))[W(0) - xo], (5.3)

where A(t, v(-)) is the convergence function of the form (3.4) and

W(0)=xo-c(zo) + R(zo)S, c(z0) = -^2, R(z0) = (5.4)

a2 - p2 a2 - p2

and S is the unit ball centered at the origin of the space Rn. Proof. First, write the set (5.2) as follows:

W(t) = x(t) + {w : |w| > (a/p)|w + z(t)|} = x(t) - c(z(t)) + R(z(t))S,

where

Now, by (3.6) and (5.4), the functions c(z(t)) and R(z(t)) can be written in the form

c(z(t)) = c(zo)A(t, v(-)), R(z(t)) = R(zo)A(t,v(-)).

Hence, we derive the validity of formula (5.3). □

Corollary 1. Lemma 4 implies that, for each t € [0, t*], the multi-valued mapping W(t) is a ball of radius R(zo)A(t, v(-)) centered at the point x(t) - c(zo)A(t, v(-)) and the set W(0) is a ball of the radius R(zo) centered at the point x(0) - c(zo).

Lemma 5 (The main lemma). The multi-valued mapping W(t) - txi is monotone decreasing in t € [0,t*] with respect to embedding, i.e., if t1 ,t2 € [0, t*] and t1 < t2, then

W(t2) - t2xi C W(ti) - tixi.

P r o o f. From the statement of the problem, we have geometric constraint of the form (2.4) on values of the evader's acceleration vector. As a consequence, we find that

Ki)|2<^T^(«2-Ki)|2). (5.5)

From the form of the resolving function (3.3), it is easy to ensure in the validity of the equality

a2 - |v(t)|2 = A(zo, v(t)) (A(zo, v(t)) - 2<v(t),ft» •

Due to this, inequality (5.5) takes the form

23 2 (2

Completing the square in this inequality, we obtain the following inequality:

a2 - (2

However, for all ^ € Rn, = 1, the relation

32 a(

Ht)+J a9X(z0,v(m0 < a2 _ 2X(zo,v(t))- (5-6)

/ 32 \ 32

V'^ + ^r^^^O, + oMWo

a2 - (2

holds. Then, using the inequality (5.6), we find that

/ 32 \ a(

V'^ + V) < ¿rf-pXizoMt)).

Integrate both sides of this inequality over the interval [0,t]:

t t / 32 \ a( f

^v(s) + —^—pHzo,v(s))£,0, ipjds < a2 _ J X(z0,v(s))d,s. (5.7)

00

For the left-hand side of (5.7), we can write the following equalities:

(v(s) +^T^^o, ip^d,s = J (v(s) +-l)\(zo,v(s))£o, ip^ds =

0

t 2 t

= J (v{s) - \{zo,v{s))£o, ip^ds + J X{z0,v{s))ds.

By the definition of n-strategy (3.2) and in view of the form of the vector c(z0) in (5.4), the latter equality takes the form

t 2 t

j (i>{s) + 2\(zo,v(s))£o, ipjds = J u(z0,v(s))ds, +

0 t 0 (5.8)

/ A(*„ 0

Taking into account the form of R(z0) in (5.4), for the right-hand side of inequality (5.7), we obtain

t t IX(z0,v(s))ds = 1X(z0,v(s))ds. (5.9)

00

Thus, by (5.7), (5.8), and (5.9), we have the relation

t

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

( J u(zo,v(s))ds, v) + ^ R{Zo) J X(zo,v(s))ds < 0. (5.10)

00

Applying the properties of the support function [7]

F(W,0) = sup (w,0) wew

in — e Rn, |0| = 1, and using formulas (3.4), (5.1), (5.3), and (5.4) we can find the derivative of W(t) in t as follows:

t

jtF(W(t),xP) = jtF(x,0+xlt + j(t-s)u(zo,v(s))ds + A(t,v(-))[R(zo)S-c(zo)}, ip) =

t

= (xi, V) + ( f u(zo,v(s))ds, v) + (c(Zo)' y R{Zo) I X(z0,v(s))ds. 0 0

d

From this and inequality (5.10), we get —F(W(t) — txi,ip) < 0 for all ip € M'\ \ip\ = 1. This completes the proof of Lemma 5. □

Corollary 2. Lemma 5 implies the inclusion W(t) C W(0) + txi for all t e [0, t*].

By Lemma 5, we obtain an attainability domain of the evader.

Lemma 6. The inclusion

y(t) e W(0)+ txi (5.11)

holds for all t e [0,t*].

Proof. The inclusion (5.11) easily follows from the form of the multi-valued mapping (5.2) and Corollary 2. □

Corollary 3. Lemma 6 implies that, if the initial velocities of the players are equal to zero, i.e., x1 = y1 = 0, then the boundary of the attainability set of the evader is the Apollonius sphere of the form (5.4) in Rn.

To solve the "Lifeline" game in favor of the pursuer, using (5.11), one can define the set

tg

Wp = U {W (0) + txi},

t=0

which is obviously convex and closed. Here

tg = y/2\zo\/(a-P) (see Theorem 1). We call the set WP the attainability domain of the pursuer.

t

6. A solution of the "Lifeline" game

In this section, the "Lifeline" game will be considered only in the case a > ^ and x1 = y1. Hereinafter, we will omit these conditions in statements for brevity.

Definition 11. We say that the n-strategy (3.2) provides winning for the pursuer in the "Lifeline" game on the time interval [0, tg] if there exists a time t* e [0, tg] such that the following conditions are valid:

(а) x(t*) = y(t*);

(б) y(s) € M for all s € [0,t*].

Theorem 3. If the attainability domain WP of the pursuer does not intersect the set M, i.e., WP n M = 0 in the "Lifeline" game, then the n-strategy (3.2) provides winning for the pursuer P.

Proof follows immediately from Theorem 1, Lemma 5, and Lemma 6. □

Now let us consider a solution of the "Lifeline" game in favor of the evader E.

Definition 12. It is said that a control v*(-) € V provides winning for the evader in the "Lifeline" game if, for any control of the pursuer u(-) € U, at least one of the following conditions holds:

а) there exists a finite time t satisfying the inclusion y(T) € M and the relation x(t) = y(t) for all t € [0,t);

б) x(t) = y(t) for all t > 0.

Consider the set

WE = {w* : w* = y/2\u - yo\/fixi + u, w € TL(0)}. We call the set WE the attainability domain of the evader.

Theorem 4. If the attainability domain WE of the evader intersects the set M, i.e., WP n M = 0 in the "Lifeline" game, then there exists a control v*(■) € V that provides winning for the evader E.

Proof. Depending on the theorem conditions, there exists a point w* € WE n M such that the following equality holds:

(Jj* = y/2\u -yo\/pxi + u, U) £ W{0). Then we prescribe the constant control of the form

v* = - yo)/|w - yo| (6.1)

for the evader E.

First of all, let us show that, by means of the control (6.1), the evader E reaches the chosen point w* at the time r = y/2\w — yo\/¡3. To this end, substituting (6.1) into (2.6), we form

T

f T2

y{r) = yo + yiT + (t - s)v*ds = yo+ yiT + —v*. (6.2)

o

From (6.1) and (6.2) and taking into account the equality xi = yi, we obtain

y(r) = y0 +xiy/2\u -yo\/f3 + uj -y0 = uj*.

Now let us prove that condition (a) of Definition 12 holds, that is, the evader remains uncaught. Suppose the opposite, i.e., that there is some control u*(-) € U of the pursuer, the implementation

of which satisfies the equality x(t) = y(t) at time t less than t, i.e., t < t. Then, by equation (3.1), we can write

t

z(i) = zo + J(i - s)(u*(s) - v*)ds = 0.

0

Thus, for the right-hand side of z(t), we can write the absolute estimations:

t t

i2

z0 — J (i — s)v*ds 0

i' t2

< /(i-s)|M*(s)|ds<a-

or

i2

i2 *

z0--V

2

i2

< (6-3)

Introducing the notation n = ¿2/2 in (6.3) and taking into account that |v*| = (, we obtain the quadratic inequality in terms of n in the form

(a2 - (2)n2 + 2n(z0,v*)-|z012 > 0.

It follows that

t 1 / \ V = 2 - (V+ (a2 -/?2)|z0|2 - (z0,v*)J . (6.4)

T2 P

y = |w-yo|//i > -2"

72

2 -~a2-f32

By the assumption

and (6.4), we determine the relation

> 0Trj2 ^{zo,v*)2 + {a2-^)U\2-{zo,v*)) • (6.5)

According to the control of the evader (6.1) and by inequality (6.5), we get

a

-\w -y0\ > \w XoI,

i.e., the inclusion w € W(0) (see (5.2)) does not hold, which contradicts our assumption. Theorem 4 is proved. □

Remark 2. Using the definitions of the attainability domain WP of the pursuer and the attainability domain WE of the evader, it is not difficult to ensure that the inclusion WE C WP is valid in the "Lifeline" game.

7. Examples

Example 1. (Problem for the case with a lifeline). Let the game (2.1)-(2.4) be given as follows:

x = u, x0 = (0,0), Xi = (0,1), |u(i)| < y/2, t> 0, (7.1)

y = v, yo = (0,-1), yi = (0,1), |v(t)|< 1, t > 0. (7.2)

Then, according to Theorem 1, we have tg = V2V2 + 2. By Lemma , we can write the set

W{0) = {(jJ : \uj - c\ < R, c = (0, -2), R = y/2}.

Figure 1. The case where the pursuer P wins in the "Lifeline" game (7.1)-(7.2).

The boundary of W(0) is

dW(0) = {w = (w 1, w2) : w2 + (w2 + 2)2 = 2}, (7.3) and the following equality is valid for these points:

\u-yo\ = + 2 +I)2. (7.4) By (7.3) and (7.4), we obtain the set

WE = jo;* = (¿>i,o>2) : ¿2 = y = ± - 2 J.

Fig. 1 and 2 show the shapes of the sets WP and WE in the "Lifeline" game (7.1)-(7.2).

Example 2. (Attainability domain in the case of many pursuers and one evader). Consider the following game example:

Xi = Ui, Xi(0) = Xio, Xi(0) = n, |u(t)| < Pi, t > 0, (7.5)

y = v, y(0)= yo, 2/(0) = n, |v(t)|< 1, t > 0, (7.6)

where ¡i{ > 1 and £¿0 2/0? i = 1? m-By Lemma 6, we have

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

m

y(t) € f| Wio + tn,

i=i

where

2

p2 Pi

Wio = X^o — Qo + Qo = —o-2—T^OJ = —O-rkiol? = — 2/0-

P2 -1 P2 -1

Figure 2. The case where the evader E wins in the "Lifeline" game (7.1)-(7.2).

The attainability domain of the pursuers in the game (7.5)-(7.6) has the form

T* m

Wp = U [ HWiQ + tn ,

where

t=0 i= 1

T * = min

2|zi01

i=l,m y l^i 1

8. Conclusion

In this paper, we have considered the pursuit-evasion problems and the "Lifeline" game of one pursuer and one evader for the inertial movements when the initial velocity vectors of the players are the same. We have imposed geometric constraints on the controls of the players. The n-strategy was suggested for the pursuer and given optimal pursuit time in the pursuit problem. We have proposed a specific strategy for the evader. By this strategy, it was proved that the n-strategy is optimal and the evasion is possible from the given initial states. As the main result, we have obtained the main lemma (Lemma 5) and applied this lemma to solve the "Lifeline" game.

A "Lifeline" game of many players, when geometric constraints are imposed on controls of players, can be studied in further research.

REFERENCES

1. Alekseev V.M., Tikhomirov V.M., Fomin S.V. Optimal'noye upravleniye [Optimal Control]. Moscow: Nauka, 1979. 432 p. (in Russian)

2. Aubin J.-P., Cellina A. Differential Inclusions. Set-Valued Maps and Viability Theory. Berlin-Heidelberg-New York-Tokyo: Springer-Verlag, 1984. 342 p. DOI: 10.1007/978-3-642-69512-4

3. Azamov A. On the quality problem for simple pursuit games with constraint. Serdica Bulgariacae Math. Publ., 1986. Vol. 12, No. 1, P. 38-43. (in Russian)

4. Azamov A.A., Samatov B.T. The n-strategy: analogies and applications. In: Proc. The Fourth Int. Conf. Game Theory and Management (GTM 2010), June 28-30, 2010, St. Petersburg, Russia, vol. 4. 2010. P. 33-47.

5. Berkovitz L. D. Differential game of generalized pursuit and evasion. SIAM J. Contr., 1986. Vol. 24, No. 3. P. 361-373. DOI: 10.1137/0324021

6. Blagodatskikh A.I., Petrov N.N. Konfliktnoe vzaimodejstvie grupp upravlyaemyh ob"ektov [Conflict Interaction Between Groups of Controlled Objects]. Izhevsk: Udmurt State University, 2009. 266 p. (in Russian)

7. Blagodatskikh V.I. Vvedenie v optimal'noye upravleniye [Introduction to Optimal Control]. Moscow: Vysshaya Shkola, 2001. 239 p. (in Russian)

8. Chikrii A. A. Conflict-Controlled Processes. Dordrecht: Kluwer Academic Publishers, 1997. 404 p. DOI: 10.1007/978-94-017-1135-7

9. Dar'in A. N., Kurzhanskii A. B. Control under indeterminacy and double constraints. Differ. Equ., 2003. Vol. 39, No. 11. P. 1554-1567. DOI: 10.1023/B:DIEQ.0000019347.24930.a3

10. Elliot R. J., Kalton N. J. The existence of value in differential games of pursuit and evasion. J. Differential Equations, 1972. Vol. 12, No. 3. P. 504-523. DOI: 10.1016/0022-0396(72)90022-8

11. Fleming W.H. The convergence problem for differential games, II. In: Advances in Game Theory, 2nd ed. Annals of Math., vol. 52. 1964. P. 195-210. DOI: 10.1515/9781400882014-013

12. Friedman A. Differential Games, 2nd ed. Pure Appl. Math., vol. 25. New York: Wiley Int., 1971. 350 p.

13. Grigorenko N.L. Matematicheskie metody upravleniya neskol'kimi dinamicheskimi processami [Mathematical Methods of Control for Several Dynamic Processes]. Moscow: Izdat. Gos. Univ., 1990. 198 p. (in Russian)

14. Hajek O. Pursuit Games: An Introduction to the Theory and Applications of Differential Games of Pursuit and Evasion. Mineola, NY: Dove. Pub. Inc., 2008. 266 p.

15. Ho Y., Bryson A., Baron S. Differential games and optimal pursuit-evasion strategies. IEEE Trans. Autom. Control, 1965. Vol. 10, No. 4. P. 385-389. DOI: 10.1109/TAC.1965.1098197

16. Ibragimov G. I. A game of optimal pursuit of one object by several. J. Appl. Math. Mech., 1998. Vol. 62, No. 2, P. 187-192. DOI: 10.1016/S0021-8928(98)00024-0

17. Ibragimov G.I. Optimal pursuit with countably many pursuers and one evader. Differ. Equ., 2005. Vol. 41, No. 5. P. 627-635. DOI: 10.1007/s10625-005-0198-y

18. Ibragimov G.I. The optimal pursuit problem reduced to an infinite system of differential equations. J. Appl. Math. Mech., 2013. Vol. 77, No. 5, P. 470-476. DOI: 10.1016/j.jappmathmech.2013.12.002

19. Ibragimov G.I. Optimal pursuit time for a differential game in the Hilbert space l2. Sci. Asia, 2013. Vol. 39S. P. 25-30. DOI: 10.2306/scienceasia1513-1874.2013.39S.025

20. Isaacs R. Differential Games. NY: John Wiley and Sons, 1965. 385 p.

21. Ivanov R. P., Ledyayev Yu. S. Time optimality for the pursuit of several objects with simple motion in a differential game. Proc. Steklov Inst. Math., 1983. Vol. 158, P. 93-103.

22. Kornev D.V., Lukoyanov N.Yu. On a minimax control problem for a positional functional under geometric and integral constraints on control actions. Proc. Steklov Inst. Math., 2016. Vol. 293, No. Supl. 1. P. S85-S100. DOI: 10.1134/S0081543816050096

23. Krasovskii N.N., Subbotin A.I. Game-Theoretical Control Problems. NY: Springer, 2011. 517 p.

24. Munts N. V., Kumkov S. S. Numerical method for solving time-optimal differential games with lifeline. Mat. Teor. Igr. Pnl, 2018. Vol. 10, No. 3. P. 48-75. (in Russian)

25. Munts N. V., Kumkov S. S. On the coincidence of the minimax solution and the value function in a time-optimal game with a lifeline. Proc. Steklov Inst. Math., 2019. Vol. 305, No. Supl. 1. P. S125-S139. DOI: 10.1134/S0081543819040138

26. Pang J.-S., Stewart D.E. Differential variational inequalities. Math. Program., 2008. Vol. 113, No. 2, Ser. A. P. 345-424. DOI: 10.1007/s10107-006-0052-x

27. Pashkov A. G., Terekhov S. D. A differential game of approach with two pursuers and one evader. J. Optim. Theory Appl., 1987. Vol. 55, No. 2. P. 303-311. DOI: 10.1007/BF00939087

28. Petrosjan L. A. Differential Games of Pursuit. Ser. Optim, vol. 2. Singapore: World Scientific Publ., 1993. 340 p. DOI: 10.1142/1670

29. Pontryagin L. S. Izbrannye trudy [Selected Works]. Moscow: MAKS Press, 2004. 551 p. (in Russian)

30. Pshenichnyi B.N. Simple pursuit by several objects. Cybernet. Systems Anal., 1976. Vol. 12, No. 5. P. 484-485.

31. Pshenichnyi B. N., Chikrii A. A., Rappoport I. S. An efficient method for solving differential games with many pursuers. Dokl. Akad. Nauk SSSR, 1981. Vol. 256, No. 3. P. 530-535. (in Russian)

32. Samatov B. T. On a pursuit-evasion problem under a linear change of the pursuer resource. Siberian Adv. Math., 2013. Vol. 23, No. 10. P. 294-302. DOI: 10.3103/S1055134413040056

33. Samatov B. T. The pursuit-evasion problem under integral-geometric constraints on pursuer controls. Autom. Remote Control, 2013. Vol. 74, No. 7. P. 1072-1081. DOI: 10.1134/S0005117913070023

34. Samatov B. T. The n-strategy in a differential game with linear control constraints. J. Appl. Math. Mech, 2014. Vol. 78, No. 3. P. 258-263. DOI: 10.1016/j.jappmathmech.2014.09.008

35. Samatov B. T. Problems of group pursuit with integral constraints on controls of the players. I. Cybernet. Systems Anal., 2013. Vol. 49, No. 5. P. 756-767. DOI: 10.1007/s10559-013-9563-7

36. Samatov B. T. Problems of group pursuit with integral constraints on controls of the players. II. Cybernet. Systems Anal., 2013. Vol. 49, No. 6. 907-921. DOI: 10.1007/s10559-013-9581-5

37. Samatov B. T., Ibragimov G., Khodjibayeva I. V. Pursuit-evasion differential games with Gronwall-type constraints on controls. Ural Math. J., 2020. Vol. 6, No. 2. P. 95-107. DOI: 10.15826/umj.2020.2.010

38. Satimov N. Yu., Rikhsiev B.B., Khamdamov A. A. On a pursuit problem for n-person linear differential and discrete games with integral constraints. Math. USSR Sb., 1983. Vol. 46, No. 4. P. 459-471. DOI: 10.1070/SM1983v046n04ABEH002946

39. Shiyuan J., Zhihua Q. Pursuit-evasion games with multi-pursuer vs. One fast evader. In: Proc. 8th World Congress on Intelligent Control and Automation, July 7-9, 2010, Jinan, China. IEEE Xplore, 2010. P. 3184-3189. DOI: 10.1109/WCICA.2010.5553770

40. Subbotin A. I. Generalization of the main equation of differential game theory. J. Optim. Theory Appl., 1984. Vol. 43, No. 1. P. 103-133. DOI: 10.1007/BF00934749

41. Subbotin A. I., Chentsov A. G. Optimizaciya garantii v zadachah upravleniya [Optimization of Guaranteed Result in Control Problems]. M: Nauka, 1981. 288 p. (in Russian)

42. Ushakov V. N. Extremal strategies in differential games with integral constraints. J. Appl. Math. Mech., 1972. Vol. 36, No. 1. P. 12-19. DOI: 10.1016/0021-8928(72)90076-7

43. Ushakov V. N., Ershov A. A., Ushakov A. V., Kuvshinov O. A. Control system depending on a parameter. Ural Math. J., 2021. Vol. 7, No. 1. P. 120-159. DOI: 10.15826/umj.2021.1.011

i Надоели баннеры? Вы всегда можете отключить рекламу.