Научная статья на тему 'A DIffERENTIAL GAME OF POLLUTION CONTROL WITH PARTICIPATION OF DEVELOPED AND DEVELOPING COUNTRIES'

A DIffERENTIAL GAME OF POLLUTION CONTROL WITH PARTICIPATION OF DEVELOPED AND DEVELOPING COUNTRIES Текст научной статьи по специальности «Математика»

CC BY
7
6
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук
Ключевые слова
POLLUTION CONTROL / DIFFERENTIAL GAMES / COOPERATIVE GAMES / RANDOM TIME HORIZON

Аннотация научной статьи по математике, автор научной работы — Gromova Ekaterina, Plekhanova Kristina

Inthispaper, a 2-player non-cooperative differential game is considered. We assume that the first player is a developed country with linear-quadratic utility function while the second player is a developing country whose utility function explicitely depends on time. Furthermore, the duration of the game is considered to be a random variable which reflects the instability of the economical development of developing countries. In the paper the duration of the game is assumed to be exponentially distributed. The optimal strategies are sought in the class of open-loop strategies.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «A DIffERENTIAL GAME OF POLLUTION CONTROL WITH PARTICIPATION OF DEVELOPED AND DEVELOPING COUNTRIES»

Contributions to Game Theory and Management, VIII, 64—83

A Differential Game of Pollution Control with Participation of Developed and Developing Countries*

Ekaterina Gromova and Kristina Plekhanova

St. Petersburg State University, Faculty of Applied Mathematics and Control Processes E-mail: ekaterina. shevkoplyas@gmail. com

Abstract In this paper, a 2-player non-cooperative differential game is considered. We assume that he first player is a developed country with linear-quadratic utility function while the second player is a developing country whose utility function explicitely depends on time. Furthermore, the duration of the game is considered to be a random variable which reflects the instability of the economical development of developing countries. In the paper the duration of the game is assumed to be exponentially distributed. The optimal strategies are sought in the class of open-loop strategies. Keywords: pollution control, differential games, cooperative games, random time horizon

1. Introduction

This contribution is devoted to the important problem of the control of environmental pollution by a group of participating countries. This problem is formulated within a game-theoretic framework; the controls are the rates of pollution, the scalar state variable is the level of pollution.

In this paper, we considered the 2-player non-cooperative differential game proposed by Massoudi and Zaccour in (Masoudi and Zaccour, 2013). In this setup, the first player is a developed country with linear-quadratic utility function while the second player is a developing country whose utility function explicitely depends on time. In this work the described model was modified, namely we changed the payoff function of the second player. Furthermore, the duration of the game is considered to be a random variable which reflects the instability of the economical development of developing countries. We assumed that the duration of the game is exponentially distributed. The optimal strategies were sought in the class of open-loop strategies.

The paper is structured as follows: in Section 2 both cooperative and non-cooperative formulations were considered. We found a Nash equilibrium solution and a Pareto optimal solution (for the case of equal weights). Moreover, we computed a cooperative solution (Shapley value) on the base of three different characteristic functions: classical maxmin approach by Neumann-Morgenstern (Von Neumann and Morgenstern, 1944), max-approach with the use of Nash equilibrium solution for left out players (Petrosjan and Zaccour, 2003), and min-approach with the use of Pareto optimal solution for the players within the coalition (Petrosyan and Gromova, 2014). Finally, an analytical formulation for the components of the imputation distribution procedure guaranteeing time consistency (Petrosyan, 1977) of the chosed solution (Shapley value) were obtained. In Section 3 we consider the 3-player game in which the first player is the developed country and the remaining two players are the developing countries. All results are illustrated by simulation results.

* This work was supported by the grants 9.38.245.2014 from St. Petersburg State University.

2. 2-person game model and main assumptions

As the basic model the 2-person differential game of pollution control proposed in (Masoudi and Zaccour, 2013) is considered. In the game r(So), let the player 1 be a developed country, which is characterized by high level of industry development, and the player 2 be a developing country with evolving costs. The game evolves in continuous time on the time interval t G [0, to). Each country i controls the volume of pollution emission Mj, i = 1, 2, with upper critical value aj (w G [0, «¿j).

Let S(t) be the accumulated volume (stock) of pollution which evolves according to the following differential equation:

S(t)= (t) + M2(t)) - SS(t), S(0)= So, (1)

where ^ is a positive parameter, S is the absorption coefficient, S0 is the initial volume of the pollution.

The utility function for player i is described as a discounted difference between revenue /¿(uj) and damage cost Dj (S), which corresponds to the expenses resulted from the environmental pollution. Suppose that functions Dj (S) are continuously differentiable and convex.

Let the revenue functions /¿(wj) for both players have a similar form:

/¿(m¿) = aMj - 2u2, i = 1,2.

Furthermore, let the damage costs for the player 1 (developed country) have the following form: Di(S) = ^S, i.e. the damage cost for developed country is proportional to the volume of pollution. Let p be the constant discount factor. Then the payoff function for player 1 is as follows:

Ji(u1,u2,S0) = J e-pi ^wi - 2ui - AS^ dt, (2)

o

subject to (1).

We assume that T has the exponential distribution with parameter A:

/(t) = Ae-A(i-io), F(t) = 1 - e-i(t-to), /L = = A(t) = A.

Damage costs function in the model (Masoudi and Zaccour, 2013) is such that the damage cost increases with time:

D2(S (t),t)= ^2 ^S(t). (3)

In our model we have T as a random value and hence instead of T in (3) we put its mathematical expectation E(T) = i. Then we get:

D2(S (t),t)= &AtS (t). (4)

In this way the following form for expected integral payoff of player 2 is obtained:

J2(ui, M2, So) = E e-pT ^«2M2 — 2w2 — ^AtS^ drdt^ =

œ t

e-pT ^a2w2 - 1 «2 - ß2ArS^ drdF(t).

0 0

After simplification of the integral payoff (Kostyunin and Shevkoplyas, 2011), we get

œ

J2(ui,«2, So) = J e-pT - 1 «I - AArS)(1 - F(tdr =

0

e-(p+A)T ( a2u2 - 1 «2 - ß2ArS ) dr. (5)

J e-(p+A)T ^a2«2 - 1 «2 - ß2 ArS^ dr.

0

It is easily seen that the integral payoff of player 2 is equivalent to the integral payoff of the respective player in the game with infinite time horizon and with discounting factor p + A.

2.1. A cooperative 2-person game

Let us now consider the cooperative game rc(So) on the base of the game r(So). It means that all players (developed country and developing country) act together to maximize their joint payoff

max(Ji(ui,U2,So) + J2(«i, u2, So)) =

U1 ,U2

oo , __V OO , __X

f e-pt (a1UT - 2u2 - AS) dt + / e-(p+A)r (a2«2 - 1 «2 - AAtS) dr, o ^ ' o ^ '

subject to S(t) = u (u1(i) + u2(t)) - ¿S(t), S(0) = So,

We will call «T, «2 optimal controls for players 1, 2, S(t) - optimal (cooperative) trajectory. To find the optimal controls in the open-loop form, we apply Pontrygin's maximum principle (see Appendix 1). Then we get:

- m f O. AAe-At feAte-At^

Mi(t) = ai - u ---1--2 + ---- , (6)

U + p (5 + A + p)2 5 + A + pi'

- /At ß1 , ß2A , ß2 At «2(t) = «2 - M e ---1--2 + ----

1 5 + p (5 + A + p)2 5 + a + p

The controls of players must be taken from the compact sets 0 < u < Oj, i = 1, 2. Under the additional condition

«1 > ^77 , (7)

0 + p

the control u1(t) is a nonnegative function and its upper value is equal to a1. The economical sense of this condition could be investigated by economists.

For the player 2 the inequality U2 < a2 is satisfied. But it may happen that at some time instant t* the function u2(t) becomes negative. Direct calculations give t*:

Y eY 1

t*

A (5 + A + p) '

where

(J + A + p)(^P - + p)) --+A+-Y = -e d+A+p.

(J +

If t > t* then the control for player 2 leaves the compact set [0, a2]. Hence after time instant t* the optimal control takes its value on the boundary of the set of admissible control values. Finally the optimal control for player 2 has the following form:

' ( xt ft feA feAt ^

. «2 - P e 7--1--9 + 7-:- , t > t ,

U2 = { J + p (J + A + p)2 J + A + p)' ' (8)

0, t < t*.

Then we get the joint maximal payoff for all players:

t*

5q, rq) — ■ " ' """" ~2

0

œ t

+ J e-pT (,„u _ i - + / ^ - I „2

t* 0

J(ui, M2, So, Ro) = J e pT ^«iui — ^ui2 — ftS^ dr+

t*

- ftR^ dr + J e-(p+A)T ^2M2 - 1M22 - ftArS^ dr+

+ J -e-(p+A)TftArRdr, (9)

t*

where R(t) is the cooperative trajectory after t*, i.e. the solution of the differential equation

R(t) = pu - SR, R(t*) = S(t*). (10)

We find the optimal (cooperative) trajectory before time instant t*

S(t) = Soe-5t + p (ai + «2) ^i(0) - P2^ (^i(0) + ^i(A)) -

S + p

(^i(0) + ^i(-A)) - s+t+- te(0) + ^(-A)), (S + A + p)2 S + A + p

and after time instant t*

p2ftA , p2 ^2 At

r:r~ ^(0) - 77—;—3^i(-A) - r , x , ^ S + pj (S + A + p)2 S + A + p

R(t) = Roe-Ät + p ( ai - ffp ) (0) - ^i(-A) - (-A),

where

. . ext - e-Ät . . ext(J + x)t - ext - e-Ät

= ' ^2(x) =-(j+x^-•

Let us denote

ßi ß2A

a = ai + «2----0,

J + P (J + A + p)2

ßi ß2A ßi , ß2A

b = —, c = - ~-^-2 ' d = ai - ~—, f = -"

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

J + P (J + A + p)2 J + P J + A + p

Then we get an analytic formula for the joint maximal payoff:

t*

J(ui,W2, So, Ro) = J e-pT ^aiui — 1 U — /iS^ dT+

o

w t*

+ J e-pT ^aiu1 — 2ul — R^ dT + J e-(p+A)T ^a2u2 — 1 u2 — /2 AtS^ dT+

t* o

w

(p2 (d + c) + ((c + 2 d) A + f) p + dA2) ai p (P + A)2

+ / — e-(p+A)T/2ATRdT = ^ v" ' ^ ' ' +

t*

— (p + 2 A)3 d2 — p (4 c2A2 + (4 pc2 +4 fc) A + c2p2 +2 pcf + 2 f2) + 2p (p + 2 A)3 +

— 2 (p + 2 A)3 (pc + f + Ac) d a/^(5ePt* — p — 5 + e-5t* A e-Pt*

+ „ , . x 2 +

2 (p + A)2 5p (p + 5)

b ((—p — 5) e-t* (p-a) + (p — A) e-(P+5)t* + 5 + a) / ^

/ ^ c

+ T—TTVT—TTT +

(5 + A) (p — A) (p + 5) (p + A) (p + 5)

(V»*p2 — 52ept* + ((5t* — 1) p + 5) (p + 5)) f/e-pt*M + 52p2 (p + 5) + (p + 5) (p + A)2 +

S So (—1 + e-(p+5)t*) e-(p+i)t* S Ro /^d (—p — 5 + e-5t* p) e-pt* + 2 p+5 + p+5 5p (p + 5) +

+ a2 e-p t* ((a2 + c + t* f) p + (a2 + c + t* f) A + f) e-A t* +

+ (p + a)2 +

a2 ((a2 + c + b) p2 + ((2 b + c + a2) A + f) p + bA2) a2 e-pt*b + "—---:—

p (p + a)2

^ (— (p2be^ A + 2 (p — A) ((a2 + c + ft*) p + f )) b (p + A)3) —

2 (p — A) p2 (p + A)

(c + a2 + b)2 p5 + 3 ((b + 1/3 a2 + 1/3 c) A + 2/3 f) (c + a2 + b) p4

2 (p — A) p2 (p + A)3 — ((3 b2 — (a2 + c)2) A2 + 4 A bf + 2 f2) p3 2 (p — A) p2 (p + A)3 ((b2 + (—4 a2 — 4 c) b — (a2 + c)2) A2 — 2 f (a2 + c) A — 2 f2) Ap2

2 (p — A) p2 (p + A)3 — 2 b (A (a2 + c) + 2 f) A3p — 2 bfA4 — 2 (p — A) p2 (p + A)3 +

/2 A^a (— (pt* + At* + 5t* + 1)e-(p+A+5)t*) /2 A^a ((pt* + At* + 1) e-(p+A)t*)

+---. , . . „2-- + "

5 (p + A + 5)2 5 (p + A)2

/2 A^a (—2 (1/25 + A + p)) (p + A)2 (p + A + 5)2

+ ■

ß2 AM6e-pt* (p2 (pt* + At* + St* + i)e-t* (S+A) + 2 (S + A) (i/2 S + p + i/2 A)ept*j

(S + A) p2 (p + A + S)2 +

+ ß2 Am (pt* + i) + (S + 2 p + 3 A) (ß) AMc + + (S + A) p2 + (p + 2 A)2 (p + A + S)2 +

Am ((St* + i + (p + A) t*) (p + A)3 e-(p+A+S)t* + ((2 + (p + A)2 t*2 + (2 p + 2 A) t*) S)) ß2 f

S2 (p + A)3 (p + A + S)2 Am (V (i + (p + A) t* )(p + A))(p + A + S)2 e-(p+A)t* - 3 S2 (p + A + 2/3 S)) ß2 f

S2 (p + A)3 (p + A + S)2 (2 S + 3 p + 4A) ßi A m f , Ro (i + (p + S) t*) A (ß) e

-(p+S) t*

(p + 2 A)3 (p + A + S)2 + (p + S)2

\2/„ + * I \ +* I c j. * I i\ „ - (p+A + S)t* I / I -y I r\2/ . *

dA (- (p + A)2 (pt* + At* + St* + i)e-(p+A+S)t* + (p + A + S)2 (pt* + At* + i)e-(p+A)t*j ß2

S (p + A)2 (p + A + S)2 '

2.2. Nash equilibrium for 2-player game

Now we find a pair of strategies (uiNE ,u2NE ) which form the Nash equilibrium solution. The optimization problems for players 1, 2 are formulated as follows

œ

max Ji(ui,uNE, So) = J e-p^ «iuiNE - 2 (ufE )2 - ßiSdt,

0

w

max Ji(uNE,M2,So) = J e-(P+A)^«2M2NE - 2 (u^)2 - ftAtSNE^ dt,

o

S(t) = p (uNE(t) + wNE(t)) - SS(t), S(0) = So. Using Pontryagin's maximum principle we get

ui = ai - p——, (11)

S + p

NE f fo A ftAt \

M2 = «2 - P -2 + 7--- .

V(S + A + p)2 S + A + pj

From condition (7) it follows that control uiNE for player 1 will not leave the admissible set [0, ai]. It is also clear that control w2NE will be bounded from above by a2. But as for cooperative case, there is a time instant t** at which w2NE(t) reaches zero (and then leave the compact set). In the same way as before we find

= a2(S + A + p) _ 1 (12) pftA (S + A + p), ( )

and

- ß2A + ß2 At \

u^ ={ «2 ^(J + A + p)2 + J + A + p)' ' (13)

0, t < t**.

We also rewrite the payoff of the player 2:

J2(ui,U2,So,Ro) = J e-(p+A)T ^«2«2 - 1 «2 - &ArS^ dr+

0

œ

^y -e-(p+A)Tß2ArRdr,

where R(t) is the trajectory after time instant t*,

R(t) = p« - 5R, R(t**) = S(t**).

Finally we get the Nash trajectory before t** in the following form:

am Q st, (1 - e (x( , Mß2A i

S (t) = Soe H--- 5 ai - —--h «2 ,—+

52 V V 1 5 + p 2 (5 + A + p)2,

p^A \ pß2At

+

(5 + A + p); 5(5 + A + p):

and after time instant t** we get

R(() = Roe- + »-£>(,(«1 -

5 V V 5 + p

The payoff of the player 2 is not shown here because of the bulkiness.

2.3. Analysis of results

In the preceding section we presented results for cooperative and non-cooperative 2-person games in analytic form. Now we give a numerical illustration of these results for the following parameters of the model:

a2 = 20; a1 = 30; ft = 0.03; ft := 0.06; A = 0.5; p = 0.1; S = 0.5; p = 0.8.

Fig. 1: Volume of emissions for developed country

As shown in Fig. 1, in cooperative case the volume of emissions for the developed country is lower than in the non-cooperative case.

Figure 2 demonstrates that in cooperative case the volume of emissions for developing country is lower than in the non-cooperative case. Finally, Figure 3 shows that in the cooperative case of the game the accumulated volume of the pollutions is lower than in the non-cooperative case of the game.

Thus the central conclusion that the cooperative behavior of the players (countries) is beneficial both for all participants of the game and also for the environment.

Fig. 2: Volume of emissions for developing country

Fig. 3: Stock of the pollution

2.4. Imputations in cooperative game

In cooperative game players have to share the total maximal payoff according to some optimality principle. In this paper we use the Shapley Value as a rule for the allocation of this amount:

Shi(So, 0) = V (n - k)!(k - 1)! [V (K; So, 0) - V (K\ {i} ; So, 0)], i G N, z—' n!

K3i

where k is number of players in coalition K, V(K, So, to), K C N is the characteristic function which shows the power of the coalition K in cooperative game rc (So), constructed by any appropriate approach (see (Von Neumann and Morgenstern, 1944; Petrosjan and Zaccour, 2003; Reddy and Zaccour, 2014; Petrosyan and Gromova, 2014)).

For game with 2 players we get:

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Shi(So, 0) = 2[V(N, So, 0) - V({2}, So, 0) + V({1}, So, 0)];

Sh2(So, 0) = 2[V(N, So, 0) - V({1}, So, 0) + V({2}, So, 0)].

We construct the characteristic function in three ways. The first way was proposed in (Von Neumann and Morgenstern, 1944) and now it is a classical approach for construction of a superaddtivie characteristic function:

0, K = {0}

irat^c ) max min Ji(«i, «2, to, So), K = {i} , .

V"(K, So,to) = ^ ui,ieKuj,j£N\K . (14)

max(J(«i, «2, to, So)), K = N

Ul,U2

By using Pontryadin's maximum principle it is easy to obtain controls, trajectory and payoffs for this maxmin approach. We do not show the analytic formulas for the reason of their bulkiness.

The second way of the characteristic function construction was proposed in the work (Petrosjan and Zaccour, 2003). It is based on the two-step procedure: first we find the Nash equilibrium for all players and then we take the Nash strategies for left-out players from N \ K while players from K maximize their joint payoff:

V5 (K,So,to) =

ui ,jeK Uj = uj NE,jeN\K

0, K = {0} Ji(ui,u2,to, So), K = {¿}

(15)

max(J(ui, «2,to, So)), K = N

«1 ,U2

In general it is not a superadditive characteristic function. The values of the characteristic function for the case of 2-player were obtained above, because here we only need to calculate payoffs in Nash equilibrium and cooperative joint payoff.

The third approach was proposed in (Petrosyan and Gromova, 2014). The characteristic function V(K, So) is constructed by the following way: first we find the optimal controls for all players, then freeze these controls for players from coalition K and then find controls of left-out players from N \ K which give the minimum to

the payoff of players from coalition K. The advantages of this approach in addition to the simplification of the calculation is that the constructed characteristic function is superadditive in general. For the case of 2-player game we have the simple formula for characteristic function:

0, K = {0}

min Jj(ui, u2, to, so), K = {i}

(16)

max(J(mi, u2, t0, S0)), K = N

U1 ,U2

From direct calculations we get

V ({1}, So)

ai ((d + c) p2 + ((c +2 d) A + f) p + dA2) P (P + A)2

+

+ - (p + A)2 (p + 2 A)3 d2 - 2p (p +2 A)3 (pc + Ac + f ) d + p (p + A)2 (p + 2 A)3 + + -p (p + A)2 (4 c2A2 + (4 f c + 4 p c2) A + c2p2 + 2 f2 + 2 p cf ) +

p (p + A)2 (p + 2 A)3

+ (a2 + b + ai) ft M +

ft m b

p (a + p)

+

M ft c

+ Mffif +

+ O / r ■ \ +

(p - A) (J + p) (p + A) (a + p) p2 (a + p)

+

M^i f

ft So

(p + A)2 (J + p) ' J + p

+

and V({2},S0).

The Shapley Value was calculated for all three methods of the characteristic function, but these are not shown here for the reasin of the very big volume of formulas. The analysis of the results will be shown below.

2.5. Imputation distribution procedure

It is important to establish the dynamically stable (time-consistent) cooperative agreement which is the Shapley Value in the framework of this paper. Time-consistency involves the property that, as the cooperation develops cooperating partners are guided by the same optimality principle at each instant of time and hence do not possess incentives to deviate from the previously adopted cooperative behavior. The approach how to avoid the problem of time-inconsistency of the solution by the special imputation distribution procedure was proposed in (Petrosyan, 1977).

We use the same ideology as in (Petrosyan, 1977). Then we suggest to distribute the imputations from Shapley Value by the inputation distribution procedure Bi(t), B2(t) such that

Sh

f><œ P^O

(S0) = e-ptBi (t) dt, Sh2 (S0) = e-(p+A)iB2 (t) dt, Jo jO

e-p(t-T)Bi (t) dt,

e-(p+A)(i-T)B2 (t) dt, T G [0, œ),

Sh2

where {Shj (ST)}i=i,2 is the Shapley value in subgames rc(S(t)) occuring along the optimal cooperative trajectory S(t). Then we get

/T w

e-ptB1 (t) dt + e-pT J e-p(t-T)B1 (t) dt,

/T /»^O

e-(p+A)iB2 (t) dt + e-(p+A)T J e-(p+A)(i-T^2 (t) dt, t G [0, to) .

It means that we can represent the Shapley Value in the game rc(So) in the followin form:

Sh1 (S0) = i e-ptB1 (t) dt + e-pTSh1 (ST) , o

Sh2 (So) = f e-(p+A)iB2 (t) dt + e-(p+A)TSh2 (st) . 0

Thus at the each time instant players are oriented to the same optimality principle (Shapley Value).

By taking the first derivative we obtain the formulas for imputation distribution procedure which are very close to formulas in (Petrosjan and Zaccour, 2003):

B1 (t) = pSh1 (st) - d~Sh1 (st) , (17)

B2 (t) = (p + A)Sh2 (st) - d~Sh2 (st) .

We calculate the imputation distribution procedure for all 3 forms of the characteristic functions but do not put it here for their bulkiness. The results will be analyzed below.

2.6. Analysis of the cooperative solution

Below, a number of plots illustrated the obtained results will be presented. We will compare different approaches to the definition of the characteristic function and consider both cooperative and non-cooperative behaviour of the players.

We see from Figure 4 that the functions have very similar values especially for small values of t. In the Fig. 5 we consider these plots for t > 150.

As shown in Fig. 6, the maximal payoff for the developed country is obtained when using the characteristic function found according to the formula(15) while the minimal payoff corresponds to the value function given by (16).

We see that the functions have similar values. In Figure 7 the plots are zoomed

in.

The plots presented in Fig. 8 and Fig. 9 show that both for the developed and developing country the maximal payoff is obtained for the case of characteristic function defined according to (15) and the minimal payoff corresponds to(16).

It tuns out that that the cooperative behaviour is beneficial both for developed and developing countries.

Fig. 4: Payoff of the developed country when considering non-cooperative solution for different types of characteristic functions

Fig. 5: Payoff of the developed country when considering non-cooperative solution for different types of characteristic functions (t > 150)

Fig. 6: Payoff of the developing country when considering non-cooperative solution for different types of characteristic functions

Fig. 7: Payoff of the developing country when considering non-cooperative solution for different types of characteristic functions (zoomed in)

Fig. 8: Payoff of the developed country for the characteristic function (16) in the cooperative and non-cooperative cases.

Fig. 9: Payoff functions of a developing country for (16)

3. 3-player game

Consider 3-player game, organized by the following way: the first player (player 1) is the same developed country as before, the second player (player 2) is the same developing country as before and the third player (player 3) is develping country identical to player 2. Then we get the following payoffs

Ji (ui, «2, «3, So) = J e ^aiui — ^«1 — AS^ dt,

o

œ

J2(«i, «2, «3, So) = J e-(p+A)r — 2«2 — AAtS^ dr,

o

œ

J3(«i, «2, «3, So) = J e-(p+A)r — 2«3 — AAtS^ dr,

œ

and dynamics of the pollution

S(t) = M («i(t) + «2(t) + «3(t)) — ¿s(t), S(0) = So.

We do not put here details how to calculate optimal controls, cooperative trajectory, payoffs, characteristic function, Shapley value and imputation distribution procedure for cooperative case of the game. We also calculate the Nash equilibrium for non-cooperative game and compare cooperative and non-cooperative behaviour of players by similar but more complicated methods as for 2-player game. Below, we present some numerical results.

According to Fig. 10, for the developed country it is more advantageous to choose the characteristic function (14).

Figure 11 shows that for the developing country the characteristic function (14) gives a bigger payoff either.

Next, as Fig. 12 shows, for the developed country it is more advantageous to choose the characteristic function (16).

For the developing country it is better to choose the characteristic function (15) (see Fig. 13).

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

4444,2

44441

T-1-1-1-1-1-1-<-1-<-1

100 110 120 130 140 130

__t_

masanin ■ ■ 1 max-min

Fig. 10: Comparison of the payoff functions for the non-cooperative case and different types of characteristic functions: Developed country

Fig. 11: Comparison of the payoff functions for the non-cooperative case and different types of characteristic functions: Developing country

Fig. 12: Comparison of payoff functions for different types of characteristic function and the cooperative behaviour: Developed country

332,8 332,6 332,4 332,2 332 331,8 331,6 331,4 331,2

"l-1-1-1-1-1-1-1-1-1-1-1-1-1-1-1-1

20 30 40 JO 60 70 80 90 100

_r_

" ■ maxmin-mas min

Fig. 13: Comparison of payoff functions for different types of characteristic function and the cooperative behaviour: Developing country

4. Appendix 1

We use Pontryagin's maximum principle to find the optimal controls for 2-player cooperative game. The Hamiltonian has a form

H

-pt (aiui - 1 «2 - ßiS^ + e-(p+A)t «2U2 - 1 «2 -

+

+ — p(«i + «2) - <S , (18)

where ^(t) is adjoint variable. From the first-order optimality condition

dH d«i

dH _

d«2

_ (ai - ui)e pt + —p _ 0, (a2 - w2)e-(p+A)i + —p _0,

we get the following formulas for optimal controls:

= «1 + epi-0p,

U2 = «2 + e(p+A)Vp.

Adjoint variable ^(t) can be found from differential equation ^(t) = — , lim ^(t) = 0. Then

— (t) _ e-ptßi + e-(p+A)tß2At +

Finally we get

ß2A

ßie-pt ß2Ae-

-(P+A)t

ß2Ate"

-(p+A)t

<5 + p (J + A + p)2/ < + P (J + A + p)2 < + A + p

where —0 is calculated as follows:

-o _ -

ßi

ß2A

ß2Ato

< + P (< + A + p)2 < + A + p'

Optimal controls have the following form:

_ ßi , ß2Ae-At ß2Ate-At

ui(t) _ ai - p 7--1--2 + 7---

U + p (< + A + p)2 < + A + p

«2(t) _ «2 - p ( eAiT^ +

ß2A

+

ß2At

< + p (< + A + p)2 < + A + p

e

References

Masoudi, N., Zaccour, G. (2013) A differential game of international pollution control with evolving environmental costs. Environment and Development Economics, 18, 680-700. Gromov, D., Gromova, E. V. (2014). Differential games with random duration: a hybrid

systems formulation. Contributions to Game Theory and Management, 7, 104-119. Kostyunin, S., Shevkoplyas, E. (2011). On simplification of integral payoff in differential games with random duration, Vestnik S. Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr., 4, 47-56.

Von Neumann, J., O. Morgenstern (1944). Theory of Games and Economic Behavior.

Princeton University Press, 1953. Petrosjan, L., G. Zaccour (2003). Time-consistent Shapley value allocation of pollution

cost reduction. J. of economic dynamics and control, 27(3). Reddy, P. V., G. Zaccour (2014). A friendly computable characteristic function. GERAD

Research Report G-2014-78. Petrosyan, L., Gromova, E. (2014) Two-level cooperation in coalitional differential games.

Trudy Inst. Mat. i Mekh. UrO RAN 01/2014, 20(3), 193-203. Petrosyan, L. (1977). Consistency of the solutions in differential games with many players. Vestnik Leningrad Univ., 4, 46-52.

i Надоели баннеры? Вы всегда можете отключить рекламу.