Contributions to Game Theory and Management, VIII, 111—136
Strategic Stability of Coalitions Technological Alliance Parameters: a Two-Level Cooperation
Nikolay V. Kolabutin
St.Petersburg State University, Faculty of Applied Mathematics and Control Processes, Bibliotechnaya pl. 2, St.Petersburg, 198504, Russia E-mail: [email protected] URL: ftp: // ftp. isdg. ru
Abstract The paper is devoted to two-level cooperation in differential games. Cooperative differential games are one of the fastest growing parts of the game theory. They are widely used for modeling the conflict-controlled processes in various fields, especially in management and economics. The solution of differential game is a cooperative agreement, and the selected principle of optimality, according to which the received payoff is distributed. The main problem of many cooperative solutions is the instability over time. Studies showed that initially selected cooperative solution often loses its optimality over time. Therefore, the question arose about the stability of the co-operative solutions. The stability can be understood as dynamic stability (time consistency), strategic stability or protection from irrational behavior. The concept of dynamic stability was formalized by L.A. Petrosyan. Cooperative solution is dynamically stable, if the principle of optimality, selected early in the game keeps its consistency throughout the gameplay. For dynamic stability is necessary at each moment of time to carry out the regularization of the chosen principle of optimality. For this regularization L.A. Petrosyan proposed to use the redistribution of received payoff in accordance with the "imputation distribution procedure". Petrosjan (1993) and Petrosjan and Zenkevich (1996) presented a detailed analysis of dynamic stability in cooperative differential games, in which the method of regu-larization was introduced to construct time-consistent solutions. Yeung and Petrosjan (2001) designed time-consistent solutions in differential games and characterized the conditions that the allocation-distribution procedure must satisfy. Petrosjan (2003) employed the regularization method to construct time-consistent bargaining procedures.
The strategic stability of cooperative solution means, that no individual deviation from the cooperation of each member brings benefits the decline member. This means that the outcome of this cooperative agreement is reached at some Nash equilibrium, which will guarantee the strategic support for such cooperation.
Recently in differential games are studied coalitional solutions in which the coalitions act as individual players. Coalitions can play with each other in a non-cooperative game, then payoff of each coalition is distributed among its members in accordance with some principle of optimality. But coalitions-players can cooperate to increase the joint payoff. In this case the joint payoff is distributed between coalitions according to some principle of optimality then coalition's share of joint payoff is distributed between its participants according to maybe other principle of optimality. This cooperation is called two-level cooperation. Optimality principles of payoff distribution between coalitions and within coalition may be different.
To solve such cooperative models which requires at both levels of the cooperation it is necessary to build the characteristic function and imputation
distribution procedure. This paper describes a model of a two-level cooperation in the technological alliance differential game. Participants of the game are the firms with the some technology that brings profit. On the first (lower) level firms form coalitions to increase joint profit. On the second (upper) level coalitions act as individual players and also form the one grand coalition to maximize the joint payoff. The top-level payoff is distributed between coalitions-participants according to some principle of optimality. Thus, each coalition-participants may get more than would receive by playing individually. Then each coalition distributes the its share of joint payoff among its firms-members. This article also presented a stable cooperative solution in this model. For its implementation at every level of cooperation we build the characteristic function and prove its superadditivity. As a principle of optimality the dynamic Shapley value is selected. Proved the dynamic stability and the strategic stability of cooperative soluton. The results are illustrated by a quantitative example.
Keywords: differential game, cooperation, imputation distribution procedure, dynamic stability, strategic stability.
1. Introduction
Consider the cooperative differential game with planned period [to, T]. Participants are firms producing some technology for which they receive profit. N = {1, ...,n} is a set of firms. Parameter of each firm i G N is the level of its technology, which is denoted by x G R+. This parameter is also called firm's state. The game begins from the state of x0 = {xj,..., x^} at the moment t0 and proceeds the period T —t0, for which firms get some profit from its technology. At the moment T firms liquidate their technology and receive additional profit Kostyunin, 2011.
The firm's profit depends on its technological level, so it seeks to increase its, investing in the development of technology. The firm's i level of investment in its technological development is called firm's strategy in the game and denoted for U G R+. This parameter is also called firm's control.
The dynamics of firm's development proceeds according to the differential equation:
where a and S are positive constants. On right side of equation imposed conditions which guarantee the existence, uniqueness and extendibility of solutions for any piecewise continuous controls «¿(s) G R+, s G [t0,T]. Also there is an additional constraint Xi(s) > 0, s G [t0, T].
The profit of firm i G N has the form:
X i (s) = ai [«i(s)xi(s)]1/2 - Sxi(s) Xi(to) = x0, i G N,
(1)
T
(2)
+ exp[-r(T - to)] qi [xi(T)]
1/2
where h (s,xj(s),Mj(s)) = Pj [xj(s)]1'2 — CjMj(s) is profit of firm i at moment s, state xj(s) and control «¿(s); P^cj are positive constants; r is discount rate; qj [xj (T)]1'2 is terminal payoff of firm i at moment T and state xj(T).
Firms can cooperate to increase joint profit. If firms form a coalition, each firm-participant can gain more abilities from other participants. Therefore the technological dynamics of firms in coalition is changed. Consider coalition K, formed by players from a subset of K C N. The dynamics of the firm in coalition takes the form:
Xj(s) = aj [uj(s)xj(s)]1/2 + ^ 6j'j][xj(s)xj(s)]1/2 — ¿Xj(s) (3)
xj(to) = x0, i G K,
where b jj,j] > 0 is positive constant which mean the technology transfer effect from firm j to firm i. The level of technology of each firm in the coalition K has a positive effect on the rate of technological development, which means that the condition [xK(s),uj(s)] /dxj > 0, j G K, where /K [xK(s),uj(s)] is the right side of dynamic equation (3). The coalition payoff is the sum of participants' payoffs.
]THi (x°,T - to,«,) = (4)
ieK
^ J hj (s, Xj(s),Mj(s))exp[-r(s - to)] ds + + ^exp[-r(T - to)] qi [xi(T)]1/2
To maximize profit of coalition K we consider the optimal control problem, which maximizes (4) with the boundary conditions (3). This maximization problem is denoted by w [K; t0; x^].
A detailed solution of this problem is described in (Yeung and Petrosyan, 2006) with using continuously differentiable function of W(io)K (t,xK(t)) : [to, T] x rijeK Rmj ^ R, which determines the max payoff of coalition K C N on the interval [t, T], where t G [to, T]. This function satisfies the Bellman equation:
-Wt(to)K (t, xk(t)) = max I V hj (t, Xj(t), u(t)) exp [-r(t - to)] +
UK Z-'
UeK
]T Wit0)K (t, XK(t)) fK [xk(t),UK(t)] i£K J
W(t0)K (T, xk(T)) = £ exp [-r(T - to)] q [xj(T)]1/2 ieK
/K [xk(t),UK(t)] = {/K [xk(t),uj(t)]} = {xxj}j£K , K С N
пек
The solution of the problem w [K; t0; xK] get function W(t0)K (t,xK(t)) in the following form:
W(to)f (t,xf(t)) =
]TAf (t) [xi(t)]1/2 + Cf(t)
Lief
exp [—r(t - to)], (5)
where values {AK (t)}jeK, CK (t) are solution of the differential equations:
b'
[ij]
AA K (t)= (r + Af (t) - ]T -jrAf (t) - Pi
jeK,j=i
(6)
Cf (t)= rCf (t) 16C-Af (t)
ief
Af (T) = qi, Cf (T) = 0, i G K
In (Yeung and Petrosyan, 2006) has been thoroughly discussed cooperative game where company form a great coalition, maximize joint payoff and share it according to the dynamic Shapley value.
In this paper is an advanced model where members of the gameplay are not individual firms {i}ieN but their coalitions which act as individual players (Petrosyan et al., 2006, Petrosyan et al., 2010).
2. The model of Game
Consider the cooperative differential game rA (x°,T — to) with duration period T — t°, where N = {1,..., n} is a set of firms. Parameter of each firm i G N or its state is the level of its technology xj G R+. Strategy of each firm i G N or its control is level of investment in its technological development u G R+. The game begins from the state of x° = {xj,..., x^ } at the moment t°. Dynamic of firm's technological development proceeds according to differential equation (1). The payoff of each firm calculate according to (2). Firms can form coalitions to increase the joint payoff.
Let A = {Ki, K2,..., Km} - coalition partition of the game, ie Kn p| =
m m
11 = 12, U K; = N, |K;| = n;, £ n; = n. Denote by M = {1, ...,m} the set of i=i ;=i partition indexes.
Introduce some notations:
xKl (s) = {xj(s)}ieKi, 1 G M is a state of coalition K; at moment s G [t°,T], defined by a set of states of its participants; xKj = {x°}ieKi is initial state coalition K;; uKl (s) = {Mi(s)}jeKi, 1 G M is control of coalition K; at the moment s, which is a set of controls of its participants; NV is coalition formed by all elements from partition A; KC C NV is any coalition formed by a subset of partition A elements; yA(t0) = ) ^KK, xk(t),T — tj is the characteristic function of the game.
Dynamics of participants of coalition K; C A proceeds according to the system of differential equations:
xi(s) = ai [ui(s)xi(s)]1/2 bjj'i] [xj(s)xi(s)]1/2 - Jxi(s) (7)
j L j
jefj, j=i
xj(t°) = x°, i G K;
Denote the right side of the equation by /Ki [xK, (s), uK, (s)]
The payoff of coalition K; is equal to the sum of participants' payoffs:
Hk (xK,,T — t°,UKi) = E Hi (x°,T — t°,Uj) = (8)
ieKi
T
^ / hj (s, xi(s),Ui(s))exp[—r(s — t°)] ds +
ieK< to
+ Eexp[—r(T — t°)] qi [xj(T)]1/2 ,
ieK;
where h (s,xj(s),Mj(s)) = Pj [x»(s)]1/2 — cjui(s) is profit of firm i at moment s G [t°,T].
3. Cooperation of coalitions
Coalitions K; C A can cooperate to raise the joint profit. This joint profit coalitions share between themselves in accordance with a some optimality principle. Consider the coalition K = K;i |J Kl2 |J .„UK;k, where K;i, K;2,..., K;k C A. The evolution of technological level of firm from K; C K satisfy the following system:
xj(s) = a [Uj(s)xj(s)]1/2 + E b^x;(s)xj(s)]1/2 + (9)
jeKji
+ E t^j(s)xi(s)]1/2 + ... + E 6ij'i][xj(s)xj(s)]1/2 + ... +
jeKi2 jeKj, j=i
+ E 6ij'j][xj (s)xj(s)]1/2 — ¿xj(s)
xj(t°) = x°, i G K; C it C N Sums ^ 6j>] [xj (s)x» (s)]1/2 represent the total effect of transfer of technology
to the firm i from the corresponding coalition. Thus, a synergistic effect of technological development of the firm i is obtained as by members of the coalition, which it originally owned, and by members of other coalitions from coalition K Simplify an expression (9):
xj(s) = a [Uj(s)xj(s)]1/2 + E 6?'j][xj(s)xj(s)]1/2 — ¿xj(s) (10)
jeK, j=j
xj(t°) = x°, i G K C N
The payoff of coalition K is its profit, which, as before, is calculated as the sum of the profits of its participants K^, Kl2,..., Klk :
k
HK (xK,T - ^«f) = (xKi£ ,T - t0,uKiJ = (11)
«=1
k / T
= ^ / hi (s, xi(s), «¿(s)) exp [—r(s - to)] ds +
+ exp [—r(T - to)] qi [xi(T)]
¿eKj,
1/2
Coalition if = formed by all participants of the game r^ (x0, T — to), will be called the the coalitions technological alliance.
To find the the solution of the coalition in the game rA (x°, T — to) is required
to find characteristic function V4(t<c) = V4(tci) ^K, xk(t),T — tj and define the imputation distribution procedure.
4. The calculation of the characteristic function VA(to) (k, xK(t), T - t) of the game rA fx0, T - t0)
Note that participants of the game rA (x0, T — to) are not the individual firm but their coalitions, therefore it is necessary to consider only subsets of K; C A and their associations.
Since in this model the formation of coalitions leads only to a change in the dynamics of the game, and formed coalitions did not interact with each other, therefore any firm j, is not a member of the coalition of K;, does not affect its development. In this case the characteristic function V4(lc) ^K, x^(t),T — tj is equal max payoff of coalition if.
To calculate the characteristic function V) ^K, x^(t),T — tj the following problem of maximization has to be solved:
VA(t°) = V) fa,xKK(t),T — t) = max (HK (xK(t),T — t,uK(t))) = V / «jf
m^ax H^i (xK; (t),T — t,MKi (t))
^ \KiCif ,
/ T
max I ^^ I ^^ hj (s,xi(s),Wj(s))exp[—r(s — to)] ds+
\KicK t ieKi
]T ]Texp[—r(T — to)] qi [xi(T)]
Kicif ieKi
1/2
W(i°)f (t, xf (t)) - continuously differentiable function, which determines the maximum payoff of coalition K in the time interval [t, T], t G [to, T].
Dynamics of development of coalition K participants proceed in accordance system (10).
The problem of maximizing the coalition payoff was described in (Yeung and Petrosyan, 2006). The function W(t°)f (t, xf (t)) satisfies the Bellman equation:
-Vf(t0) fK,xf (t),T - t[ = (12)
max < y^ hi (t, xi(t), u(t)) exp [-r(t - to)] +
, ief
+ E ) (itxf (t),T - t/ [xf (t),uf (t)]
ffCf
V?(t0) (K,xf (T),t) = E exp [-r(T - to)] qi [xi(T)]1/2
,x
fl
flCf
t0)
J, ^ j —
ief
Here:
Vfl(t0) (it,xf (t),T - t) = gradVfl(i°) (it,xf (t),T - t)
= { VX^(i0) (i^ (t),T - t)}
ief i
/£ [x*(t),Ufi (t)] = {/* [xff (t),Ui(t)] }iefi = {xi(t)}iefi Ki C K
Expression V?(t0) ÎK, x^(t), T - t j [x^(t), ufl (t)] can be written as amount:
0)
ie fi
Substituting (13) in (12), get:
Vf?'0 ) (i^ ^ (t),T - t)/| [x ff (t),ufi (t)] = (13)
E Vx?(t0) (it,xff (t),T - t) /* [xff (t),Ui(t)]
-Vt?(i0) (if ,xk(t),T - t) = (14)
max < E hi (t, xi(t), u(t)) exp [-r(t - to)] +
ie Kf 0)
+ E VX?(t0) (K,x*(t),T - t) /f [xff (t),Ui(t)]
ie Kf
V?(t0) (it,x*(T),t) = E exp [-r(T - to)] qi [xi(T)]1/2
ie Kf
Taking the partial derivatives of the values {wi}ief from the expression under the sign of max, and equating them to zero, obtain the formula for the optimal controls:
VX4(t0) (K,xf(t),T - t) exp [r(t - io)jl xi, « G it (15)
4(ci)2
Substituting (15) in (14) and solving the resulting equation, get:
V) (it,xf(t),T - t)
]TAf (t) [xi (t)]1/2 + Cf (t)
¿ef
exp [—r(t - to)] (16)
Values |ak (t) j ^ „ and CK (t) are solutions of differential equations:
Af (t)= (r + 0 Af (t) - £ Af (t) - Pi
jef ,j=i
Cf (t) = rCf (t) 16c-Af (t),
¿ef -
Af (T) = qj, Cf (T) = 0, i G if
Thus, the characteristic function of V4(io) (k, xf (t), T — tj is equal to the Bellman function W(t°)f (t, xf (t)). Optimal control of the coalition if have the form:
uf (t) = K(t)}-
¿ef
16(Ci)2
Af (t)
¿ef
Dynamics of coalition development takes the form:
(17)
xj(s) = 4jAf (s)[xj(s)]1/2 + ^ 6f][x*(sK(s)]1/2 — ¿xj(s) (18)
j ¿ef
xj(to) = x0, i G if, s G [to,T] Thus the characteristic function V4(io) fif, xf (t), T — tj is defined as follows:
0 K = 0
V4(i0) (K, xf (t), T - t) = <( W(t0)fl (t, xf (t)) K = K, c A (19)
W(t0)ff (t,xf (t)) K = it Ç N,
2
a
u =
a2
2
where W(to)K (t,xK(t)) is defined by (5)
Establish superadditivity of function V4(io) (K, (t), T - t). Given (19) if the function W(to)K (t, (t)) is superadditive, then superadditivity of function V De"°(io) (K, (t), T - t) is obvious.
Firstly give without proof theorem about the comparison of solutions (Chaply-
Theorem 1 (about the comparison of solutions). Given two Cauchy problem:
For each problem the conditions of existence and uniqueness solutions is maintained, and furthermore the condition:
Assume yo > Then for all t > to the following condition is maintained:
Proof the superadditivity of function W(to)K (t,xK (t)).
Theorem 2 (about superadditivity of function). Characteristic function W(io)K (t,XK(t)) defined by (5) is superadditive.
Proof. Consider two coalitions Si, S2 C N, Si n S2 = 0 and their union Si U S2. For each of these coalitions function W(to)K (t,xK(t)) takes the following form::
gin, 1950)
yi(t) = /i (t,yi(t)), yi(to) =
2/2 (t) = /2 (t, /2 (t)), /2 (to) = y0
fi (t,u(t)) > /2 (t,u(t)), V (t,u(t))
yi(t,to,/0) > /2(t, to, y2 )
W(io)SlUS2 (t,xSlUS2(t))
W(to)Sl (t,xSl (t))
W(tc)S2 (t,xs2(t)) =
T
= m ax y^ / Pi [xi(s)]1'2 — ci«i(s) exp [—r(s — to)] ds+
"Si
1 Vef't
+ ^exp[—r(T — to)] qj [xj(T)]
ieSi
The equations of dynamics for coalitions have the form:
1/2
xi (s) = /flUS2 [s,xSiUS2 (s),«i(s)] = ai [«i(s)xi (s)]1/2 + (20)
[xj (
+ E 6ij'i][xj (s)xi (s)]1/2 — Jxi (s)
jeSiUS2, j=i
xi(to) = xo, i G S1 U S2, s G [to, T]
xi (s) = /fi [s, xsi (s),«i(s)] = ai [«i(s)xi(s)]1/2 + (21)
+ E 6ij'i][xj (s)xi (s)]1/2 — ¿xi (s) jeSi, j=i
xi (to) = xo, i G S1, s G [to, T]
xi (s) = /S2 [s,xs2 (s),ui(s)] = ai [«i(s)xi(s)]1/2 + (22)
bjj,i] [xj (
+ E 6ij'i][xj (s)xi (s)]1/2 — ¿xi (s)
jeS2, j=j
xj(to) = xo, i G S2,s G [to, T]
From condition xj(s) > 0 for Vi G N obtained, that for any uj:
/SiUS2 [s, xsiUS2 (s),«i(s)] > /iSi [s,xsi (s),«i(s)], i G S1 /SiUS2 [s,xsiUS2(s),«i(s)] > /f2 [s,xs2 (s),«i(s)], i G S2
Let solutions of equations(20),(21) and (22) respectively jxfiUf2(s)|
xSi (s)}iesi an^xf2(s)}ieS2. Given theorem 1, for any valid «¿(s) the level of firm's technology xfiUf2 (s) > xf (s), i G S1 and xf US2(s) > xf2(s), i G S2.
Given the firm's payoff formula (2) obtain that for any valid «¿(s) payoff of firm Hi (xi(t), T — t, ui) in coalition S1 U S2 is greater than payoff of the same firm in the coalition S1 or S2.
iefiUf2
Hi (xf1Uf2 (t),T - t,ui(t)) > Hi (xf1 (t),T - t,ui(t^ i G Si Hi (xflUS2 (t),T - t,ui(t)) > Hi (xf2 (t),T - t,ui(t^ i G S2 Summing firms' payoffs for coalitions, get
]THi (xflUf2 (t),T - t,Ui(t}) >E Hi (xf1 (t),T - t,ui(t)) (23)
ieSi iefi
E Hi (xflUf2 (t),T - t,ui(t}) > E Hi (xf2 (t),T - t,ui(t})
ief2 ief2
Denote by \ uf 1 (sH and < uf2 (sH controls, maximizing payoffs' sum in L J ies1 L J ies2 coalitions S1 and S2. Substituting them in (23), get:
EHi (xf1 Uf2 (t),T - t,uf (t)) > E Hi (xf1 (t),T - t,uf (t)) (24) iefi i£f1
E Hi (xf1Uf2 (t), T - t, uf2 (t)) > E Hi (xf2 (t), T - t, uf2 (t))
ief2 ief2
Adding the inequality (24), get:
E Hi (xf1Uf2 (t),T - t,uf (t)) + E Hi (xf1Uf2 (t),T - t,uf2 (t)) >
ief1 ief2
> E Hi (xf1 (t), T - t, uf1 (t)) + E Hi (xf2 (t), T - t, uf2 (t))
ief1 ie»2
Denote by < uf1UfM controls, maximizing summs of payoffs in coali-
tion S1 U S2. Then:
E Hi (xflUf2 (t),T - t,uflUf2 (t)) = (25)
f 1 Uf 2
{ E Hi (xf1 Uf2 (t), T - t,ui(t))l > UefiUf2 )
> E Hi (xflUf2 (t), T - t,uf (t)) + E Hi (xflUf2 (t), T - t,uf2 (t)) >
iefi ief2
> E Hi (xfl (t), T - t, ufl (t)) + E Hi (xf2 (t), T - t, uf2 (t)) =
iefl ief2
= maJ VHi (xf (t),T - t,Ui(t)H +
Mi'iefl litfl V 1J
+ ma^ Hi (xf2 (t),T - t,ui(t)) I
«i,ief2 — v /
Uef2 J
max
м¿,ies1us2
By definition:
max
u ,iefiUf2
E Hi (xfiUf2 (t), T — t,ui(t)) I = W(t0)fiUf2 (t,xsiUf2 (t)) (26) ,iefiUf2 J
"inaf J E Hi (x,fi (t),T — t,ui(t)) | = W(t0)fi (t,xsi (t))
ui,iefi l^iefi J
max J V Hi fxf2(t),T — t,«i(t)) I = W(t0)f2 (t,xf2 (t))
Ui'ief2 lits2 v J J
Substituting (26) in (25) obtain:
W(t0)fiUf2 (t, xfiUf2 (t)) > W(t0)fi (t, xsi (t)) + W(t0)f2 (t, xf2 (t)), as required.
5. The imputation distribution procedure in the coalitions technological alliance
Suppose that the participants of coalitions technological alliance share joint payoff in accordance with the dynamic Shapley value (Petrosyan and Zaccour, 2003). Note calculated the share of coalition i C 4. The formula for the components of the Shapley value takes the following form:
VKi (V) = E (k — 1)!(m — k)! [v(it) — V(it \ K;)] (27)
z—' m! L J
K CNN
where it = K;i y Kl2 y ... U Klfc is subset of coalitions from partition 4, K;£, £ = 1,..., k, k is number of coalitions included in coalition i
To maximize revenue of technological alliance, the players on the interval [to, T] will use a set of controls defined by the formula (17), and implement the optimal cooperative trajectory (18) in the case of k = NV. It is anticipated that the division of the joint income of the players will use the Shapley value, the components of which are calculated by the formula (27). At the initial time to, the coalition's K; share of joint payoff is equal to:
vKr (to,xN) = E
(k — 1)!(m — k)!
V4(t° ) (i,xK,T — to) — (28)
K CNV
— V^ (it \ K;,x°K\k , T — to)
Given that V4(to) (it,xK(t),T — tj = W(to)K (t,xK(t)), expression (28) can be rewritten:
m!
^ (to, xN) = £ (k - 1)mm - k)! k™K (to, xKk) - (29)
K Cd
- W(to)K\Kl (to,x°f\K.)
The Shapley value has to be maintained throughout the game. Therefore, in every moment the next condition has to be maintained:
vK) (t,xN (t))= z (k - i)mm- k)! [W (to)K (t,xjK (t)) -
- W (to)KK\Ki (t,xt \k, (t));
For realization the dynamic Shapley is necessary to determine the imputation distribution procedure (Petrosyan, 1977), to compensate transient changes. Define the imputation distribution procedure as a function of B^(t) = (t)}t=to, such that:
(to, xN) = J BKl (s) exp [-r(s - to)] ds + (30)
to
+ exp [-r(T - to)] E qi [x!(T)]1/2
ieKi
The function BKl (t) is a payment received by a coalition K; at the moment t after redistribution of joint payoff. For dynamic stability the next equality has to be maintained at each moment:
T
vKO) (t,xN (t))=/ BKl (s) exp [-r(s - to)] ds + (31)
(to ) U „*
l
t
+ exp [-r(T - to)] E qi [x!(T)]1/2
ieKi
From (30) and (31):
vKO) (to,xN) =J BKl (s) exp [-r(s - to)] ds + (t, xN(t)) (32)
to
This condition means temporary solvency or a dynamic stability (Petrosyan, 1977, Zenkevich and Petrosyan, 2007) decision on coalition parties {K;}. But it is also necessary to show the dynamic stability of the solution with respect to each individual firm. This will be done below.
t
Note that in every moment of s G [to, T] occurs only redistribution of the joint profits, but the sum of players payoffs does not changes:
]T Bk (s) = E Ehi M (s),< (s)) =
Ki C4 KtCAieKi
= EE [Pi [x.-(s)]1'2 - <W (s)
KiC^ieKi
In this case the function BKi (s) has the form:
Bki (s) = E
(k — 1)!(m — k)!
Wt(s)KV (s, x*v(s)) — W/s)KV\Kl
s,xK\k (s)y
+
K Cd
+ E [W(s)KV (s, x*K (s))j /f [xf (s),«j(s)] — jeK
— E [Wi
heK\k
s)K\K : s *
s,xK\K (s) fh [xf (s),uh(s)]
m!
6. The payoff distribution within the coalition Ki
Coalition's K; share of joint payoff has to be distributed between firms-participants. Suppose that firms act cooperatively within the coalition. Define the cooperative game rKi (x°, T - t°), where K; is the set of [layers, and V(to)Ki (L, xL(t), T - t), L C K; is characteristic function, calculated on the assumption, that firms, not included in K;, use their optimal coalition strategies, and firms, not included in L use their optimal individual strategies in game rKi (x°, T -t°).
To calculate the payoff share of firm i G K; is necessary to calculate the value of the characteristic function in the game rKi (x°,T -t°) and determine the imputation distribution procedure.
The characteristic function will be calculated as follows. First, we calculate the value of the characteristic function for the coalition K;, then for any coalition L C K;.
In calculating the characteristic function for the coalition Kr necessary to consider that it participates in the game of coalitions rA (x°, T - t°). Therefore it gets more payoff than could be obtained by playing individually. Since any subcoalition K C K; is not included in the partition 4, assume that it does not have those bonuses, which are available to the coalition K;, and its characteristic function will be calculated without considering of coalitions game.
The value of the characteristic function VKi(i°) (K;,xKi (t),T - t) should be equal to the maximum payoff that coalition K; can receive. If coalition played alone, its maximum payoff would be equal to the function W(t°)Ki (t, xKi (t)), defined formula (19). This function determines the maximum winning coalition K; in the case of individual development of the game of coalitions. But since the coalition combined into a technological alliance, the result of profit distribution in the upper level, each coalition K; gets a share of revenue equal to component of the Shapley value vK^ (t, xN(t)), which are calculated according to the formula (29).
Due to individual rationality v^ (t,xN(t)) > W(to)Kl (t,xK;(t)), payoff of coalition is higher than in individual development. Therefore, the characteristic function V(to) (i);,x|fi (t),T -t) is equal to the payoff of coalition K; in the game rA (x0, T - to), ie component of the Shapley value:
VKl(to) (K;,xKi (t),T -1) = v(Ko) (t,xN(t))
Calculate the characteristic function of any coalition L C K;. The function V(to) (L,xL(t),T - t) is found by solving the following optimization problem:
V(to) (L, xL(t), T - t) = max (HL (xL(t),T - t,uL(t))) =
= mx £ Hi (xi(t),T - t,ui(t)) j = "L VieL /
= max I y^ J hi (s,xi(s),ui(s))exp[-r(s - to)] + "L \ieL t
+ Eexp [-r(T - to)] qi [xi(T)] 1/2 ) , L C K;
ieL /
Function VKl(to) (L,xL(t),T - t) satisfies the Bellman equation:
-VtKl(to) (L,xl(t),T -1)
= max < hi (t, xi(t), ui(t)) exp [-r(t - to)] +
UL I ieL
+ E VxK1 (to) (L, xL(t), T - t) fL [xL(t), ui(t)] |
ieL J
VKl(to) (L, xN, T) = $>xp [-r(T - to)] qi [xi(T)]1/2 ieL
hi (t,xi (t),ui(t)) = [Pi[xi(t)]1/2 - Ciui(t)
fL [xl(t),ui(t)] = a [ui(s)xi(s)]1/2 + £ 6jj'i][xj(s)xi(s)]1/2 - ¿xi(s) Using a similar technique to that described above, receive:
V(to) (L,xL(t),T -1) =
]TAL(t) [xi(t)]1/2 + CL(t)
ie L
exp [-r(t - to)]
Values AL(t) and CL(t) are solutions of differential equations:
(t)= (r + 2 AL(t) - £ j-A^t) - Pi
CL(t) = rC1 (t) ^Af (t)
-eL -
AL(T)= qi, CL(T) = 0, i e L C K; From the equations above is formula of partial derivatives:
VKi(io) (L,xL(t),T -1)= ^
(t) [x-(t)]1/2 + cL(t)
]TAf(t) [x-(t)]1/2 + CL(t)
aL
-eL
V2 ^ rri
.-eL
exp [ r(t - to)]
VKi(io) (L, xL(t), T - t) = ^AL(t) [x-(t)]-1/2 exp [-r(t - to)]
It is easy to verify that the characteristic function VKi(io) (L, xL(t),T -1) coincides in this case with the function W(to)L (t,xL (t)), which determines the maximum payoff of coalition L in game of firms (Yeung and Petrosyan, 2006). Thus, the characteristic function of the game PKl (x°, T - to) has the following form:
r
0 K = 0
vKi(io) (K,xK(t),t -1) = J ))(t;(t)) K = K; (33)
v ' KW' ; ^ w(to)- (t,x-(t)) k = {¿}e K v ;
i
W(to)L (t,xL(t)) K = L c K;
The superadditivity of function P(xo, T - to) obviously follows from superad-ditivity of function W(to )L (t, xL (t)) (Theorem 2) and the condition vK°} (t, xN (t)) >
W(to)Ki (t,xK; (t)).
Introduce the imputation distribution procedure in the game. Assume that firms share the payoff of coalition K; according to the dynamic Shapley value v(to)Ki (to,xKJ = |vi(to)Ki (to,xKJ} . again. Since coalition K; involved in
the game of coalitions P^ (xo,T - to), then its participants maximize joint payoff W(to)N (to,xN), using a set of optimal controls ju*(t)}i£N, obtained by the formula (17) on the interval [to, T] and implement corresponding optimal trajectory for the case it = NV.
At the initial moment to, the share of firm's i e K; cooperative profit is equal
to:
,(to)Ki
(to,xKi)
(k — 1)!(k, — k)!
KCK,
k,!
VKi(to) (K,xK, T — to) —
— VKl(to) (k \ i,xK\i,T — to
where k; = |K; | is number of coalition members K;.
The Shapley value has to be maintained throughout the game At moment t G [t°,T] and state xKi (t) G xN(t) for firm i G K; has to be maintaned the following principle:
v(t0 )Ki (t,xK, w) = E
(k — 1)!(k, — k)!
KCK,
k,!
VKl(t0) (K,xK(t),T — t) —
— VK(to) (k \ i,xK(t), T — t)
Given (33), the formula for the components of the Shapley value can be rewritten as follows:
,(to)K,
(t,xK, (t)) = E
KCK,
(k — 1)!(k, — k)!
k, !
W(to)K (t,xK(t)) —
— W(to)K ft,xK\i(t)
+
+
k,
vKto) (t,xK, (t^ — W(to)Kl\4 (t,xK,\i(t)
For realization the dynamic Shapley is necessary at any moment to make a redistribution of the joint payoff. Define the imputation distribution procedure for
allocating sharing (Petrosyan, 1977), as a function of BKi (t) = jBKi (t) j , such
that:
(t0)K,
(to,xK,) = J BK1 (s) exp [—r(s — to)] ds + t0
+ exp [—r(T — to)] qi [x*(T)]1/2
(34)
Function Bj 1 (s) is an payment received by firm i G K, at the moment s G [to, T]. For dynamic stability of solution the next equality has to be maintained at any moment t G [to, T]:
v(to)Kl (t,xK, (t)) =| BK1 (s) exp [—r(s — to)] ds + t
+ exp[—r(T — to)] qi [x*(T)]1/2
(35)
1
From (34) and (35) receive: t
vf°)Ki (t°,xKi r = / BiKi (s) exp [-r(s - t°)] ds + v(t0)Ki (t,x^ (t)r
to
Function BKi (t) is determined from the derivative component of the Shapley value v(t°)Ki (t°xKJ. Note, that characteristic function VKi(to) (K;, xKi (t), T - t) is equal component of Shapley value vK°) (t, xN(t)) in the game of coalitions rA (x°,T - t°), which depends from the states of all participants of the partition 4. Consequently, partial derivatives of the components v(t°)Ki (t, xKi (t)) for firms' j G K; states are not zero.
In the general case BKi (s) has the form:
BK (s) = E
KCK,
(k — 1)!(k, — k)! k,!
Wt(s)K (s,xK(s)) — (36)
Wt
(s)K\i
s, xK\i(s
(s) + E Wis)K (s,xK(s)) /f [xf (s),uj(s)] —
jeK
— Wi(
heK\i 1 k,
s,xK\i(sM /f [xf (s),uh(s)0 +
+ k i Ko^ t (s,xf (s)) — Wt(s)K,\i (s,xK,\i(s^ +
+ E
(to)
K,
(s,xf (s)) /f [xf (s),js)] —
jef
— Wi(
hef \i
(s)K,\i
s,xK,\i(s^ fh [xf(s),uh(s)]
7. Dynamic stability of coalitions technological alliance
To prove the dynamic stability of the coalition built solution necessary to show that at each moment there is only a redistribution of payoff between all firms i G N, and the total sum of profit remains unchanged, ie it is necessary to prove:
h
i
j
h
E EBf (s)= E Ehi (s,x?(s),<(s))= (37)
KicdieKi KicdieKi
= E E [Pi [xi(s)]1'2 — ciui(s)
K,cdieK,
It has already been established that at any time there is a redistribution of profits between coalitions participating game rA (xo,T — to). Consequently, equality (37) can be rewritten as:
E EBKi (s)= E Bki (s) Ki c^-eKi k c4
To prove this equality it's enough to show that ^ Bi 1 (s) = BKl (s) for any
coalition-participant K; C
Summing component (s), is easy to check, that:
EBK (s) = -( [vKo ^ t (s,xN (s)) +
-eKi
+ E [v(KoT x (s,xN (s)) /f K (s),u-(s)]
j£N
The right side of this equation is the total derivative of v^ ) (t, xN (t)) for t with the opposite sign, which by definition equal BKl (s). Thus, the resulting coalition solution is dynamically stable.
8. Strategic stability of coalitions technological alliance
Show the strategic stability of cooperative solution. Recall that the cooperative solution is strategically stable if no individual deviation from the cooperation of each member brings benefits the decline member.
Since members of a cooperative game (xN, T - to) is not individual firms but coalitions {K;};eM, consider that a individual firm i e K; can not itself get out of the cooperation N Only the whole coalition K; could.
Formulate a theorem on the existence of e-Nash-equilibrium in the game r/ (xN, T - to).
Theorem 3. In the game r^ (xN, T - to) for any e > 0, there is e-Nash-equilibrium with payoffs equal components of share jv^'' (to,xN)| •
Proof. The proof is done by constructing an optimal control at the class of piecewise program strategies. Consider a family associated with the game Ty (x, T - t) games rKj N\Kl (x, T - t) from the initial state x = {xi}ieN duration T-1, where coalition K; is played individually, and the other coalitions-participants united in a coalition of N\K;. Since n(x, T - t) is not a zero-sum game between the K; and N\K; , the maximum payoff of coalition K; in the game rKijN\Kl (x, T - t) is determined by the function W(to)Kl (t,x^ (t)).
Let iiA\Kl (x, t; •) - this is N\K; is optimal piecewise program strategy of the coalition N\K; in the game rKl ,N\Kl (x, Tt). For ease of description denote it N_Ki by optimal strategy. This strategy leads to the maximization of the coalition win N\K;.
It is logical to assume that in case of deviation the coalition K; from the optimal control of the cooperative «K (s) e «N(s) and cooperative trajectory
x^ (s) G xN(s) it stops receiving the bonuses of which gave the alliance and continues to evolve independently. But since piecewise program strategies used, it is not happening at once but when switching jt^} , £ = 0,1,..., 0 — 1.
Let X(t) = jxKl (t),..., xKm (t)} is a segment of admissible trajectory of players, defined on the interval [to, t], t G [t0, T]. For each K; G A = {Ki,... , Km} define the values: t(l) = sup {t; : xKl (t;) = x^ (t;)} and t(h) = min jt(l) = t(h)}.
Here t(h) belongs to one of the intervals [t^, t^+1], £ = 0,1,..., 0 — 1. Thus, the interval [t0, t(l)] is the period where xKl (t) = x^ (t), and [t0, t(h)] is period, in which x(t) = xN(t). Note that in this case the value of xKl (t) is a set of trajectories of firms from coalition K;, ie xKl (t) = jx.j(T)}ieKl. Therefore, the expression xKl (t) = x^ (t) means that x^(t) = x*(t) for all i G K;.
Define the following strategies for the coalition K;, in condition t^ < t(h) < t^+1:
((t) x(t) = xn(t), T G [to,t«] «* (t ) =\ (x(t5+i),t5+i), t G (t5 ,t5+i] (38)
[VuKi(t), t G (t5+i,T]
Prove that situation «*(•) = {«K(•)}K where the strategy defined by the formula (38) will be e-Nash-equilibrium in the game Ty (xN, T —10).
If all the coalition-participants do not deviate from the conditionally optimal cooperative trajectory throughout the game, the payoff of coalition K; is equal to the corresponding component of the Shapley value:
Hkj (xK; ,T — t0,uK) = / BKl (s) exp [—r(s — t0)] ds + (39)
J to
+ exp [—r(T — t0)] E ® (xi(T))i/2 = v(K0)(t0,xN)
ieKj
Let coalition K; deviated from the cooperative trajectory at the moment t G [t^_i,t^], changing its strategy on uKl. Show that:
Hk (xKl,T — t0,uKj > HK(xKl,T — t0,UKi) — e, (40)
for all K; C A and any strategy uKl. Assume that the resulting trajectory of x(t) is different from the path of xN(t). If from the moment t^, coalition {A \ K;} will use its N-Kl-optimal strategy for U^\Kl (x,t; •), then, from this moment, max payoff of coalition K;, with discount will be:
Hk, (xk, (tç), T - tç, «K ) = exp [-r(tç - to)] *
*(E/ hi (s,xi (s),ui (s))exp[-r(s - tç )] ds • + VieKj ^
+ exp [-r(T - tç)] E ® (xi(T))1/2) =
¿EK, /
i t
= E / hi (s,xi (s),«i (s)) exp [-r(s - to)] ds + -'if
T
'. ; , xi (s), «i
+ exp[-r(T - to)]£ qi (xi(T))1/2 ieKj
Then its total payoff will be:
Hk; (xK, ,T - to,«K,) = / Bk; (s) exp [-r(s - to)] ds + (41)
J to fk '
/ BK, (s, x(s)) exp [-r(s - to)] ds +
f T
E / hi (s, xi(s),«i(s))exp[-r(s - to)] ds + i£K^ t£
exp [-r(T - to)] E qi (xi(T))1/2 ,
i£K,
where the function BK, (s, x(s)) is the payment received by a coalition K; at moment s e [t,t^) and calculated as result of the imputation distribution procedure where the trajectory xN(s) is replaced by the path x(s). Given that
C t
v(Ko) (to,xN) = / Bk, (s) exp [-r(s - to)] ds + v^ (t, xN(t)), lt0l
and besides
E / hi (s,xi(s),«i(s)) exp [-r(s - to)] ds + + exp [-r(T - to)] £ qi (xi(T))1/2 = W(i0)Kl (tç, xK, (tç)),
-eKi
the payoff of coalition K; is equal:
Hk, (xK,, T - to, UK,) = vK0^(to, xN) - vK0}(t, xN(t)) + (42)
r if
+ J BK, (s, x(s)) exp [-r(s - to)] ds + W(i»)Ki (tç, xk, (tç))
Since t G [t^_i,t^], then consequently
v(K0) (t,xN(t)) > v(K0) (t5,xn(t5))
and
rt£ rt£
/ BKl (s,x(s))exp [—r(s — t0)] ds < / BKl (s, x(s)) exp [—r(s — t0)] ds
Jt Jt^-1
Therefore, the coalition's K; payoff are not greater than
HKl (xKl, T — t0,UKl) < vK0) (t0,xN) — vK0) (t,xN(t))+ (43)
r H
+ / BKl (s, x(s)) exp [—r(s — t0)] ds + W(to)Kl (t5, x^ (t5)) Jt£-1
Choosing a value S > 0 (t^ — = S) small enough can be ensure that the integral Jtf£ i BKl (s, x(s)) exp [—r(s — t0)] ds was less than e/2. Then the following inequality is maintained:
Hki (xKl, T — t0,UKl) < vK0) (t0,xN) — vK0) (t,xN(t))+ (44)
r t£
+ / BKl (s, x(s)) exp [—r (s — t0)] ds + W(to)Kl (t5, x^ (t5)) < Jt£-1
< v(K0) (t0, xN) — v(K0) (t, xN(t)) + W(to)Kl (t5, xki (t5)) + e/2 It is incorrect to argue that
vK0) (t,xN(t)) > W(to)Kl (t5,xki (t5)),
because the state xKl (t^) is not included in cooperative trajectory xN. But because the dynamic Shapley value has the property of individual rationality, ie
vKo) (t,xN(t)) > W(to)Kl (t5,xKl (t5)) ,
then choosing the value S > 0 (t^ — = S) small enough can ensure that v(K0) (t,xN(t)) > W(to)Kl (t5,xki (t5)) — e/2,
Therefore
vK0) (t0,xN) — vK0) (t,xN(t))+ (45)
+W(to)Kl (t5, xki (t5)) + e/2 < v(K0) (t0, xN) + e
Strategic Stability of Coalitions Technological Alliance Parameters As a result obtain the inequality:
Hk1 (x°Kl ,T - to,UK,) < vg) (to,x°N) + £ = = Hk (xK,, T - to, u*k1 ) + £,
as required.
The strategic stability within the coalition proved similarly. Thus the strategic stability of the coalition solution is proved.
9. Quantitative example
Illustrate numerical results on the example of the three firms. On the set of firms N = {1, 2, 3} defined partition A = {{1, 2}, {3}} consisting of two coalitions. Given initial parameters: t0 = 0; T = 20; r = 0,1; 6 = 0, 2; P1 = 0, 6; P2 = 0, 3; P3 = 0,15; a, = 0, 5; ai = 0, 3; bfi] = 0, 05; q = 0,1 i, j =e N
Figure 1 shows graphics of coalitions states for a game duration in the case of individual develop]
Fig. 1: Graphics of coalitions states in the case of individual development
On the figure 2 here are graphics of coalitions states for a game duration in the case of cooperation:
On the figures 3 and 28 here are graphics of coalitions profit before redistribution (hij2(t) = hi(t) + h2(t) and h3(t)) and after redistribution (Bij2(t) and B{3}(t)).
{12}
Figure 5 illustrate profit's shares of firms 1 and 2 inside coalition {1, 2} (B{ ' } (t)
and B2{1 '2}(t))
ft ' i, ' 10 15 30
Fig. 2: graphics of coalitions states in the case of technological alliance
5 10 15 29
Fig. 3: Graphics of coalitions profit before redistribution
Table 5.3. shows the values of coalitions payoffs in randomly selected time points before and after the payoff redistribution. The values of profits before and after redistribution are different, but the sum does not change.
Table 1: Values of coalitions payoffs before and after redistribution
t h{1,2} (t) h{3}(t) h{i,2} (t)+ h{3} (t) B{1,2}(t) B{3} (t) B{l,2}(t)+ B{3}(t)
0 -0,04031 0,02488 -0,01543 -0,07842 0,06299 -0,01543
5 0,85579 0,09914 0,95493 0,81135 0,14358 0,95493
10 1,51064 0,16949 1,68013 1,46475 0,21538 1,68013
Table 2 shows the value of profits of firms 1 and 2 in the coalition {1, 2} in a randomly selected time points. At each point equal the sum of the profits of firms profit coalition.
Table 2: Values of firms payoffs after payoff redistribution
t B{1'2} (t) B2 B{1,2} (t) + B{1,2}(t) B{1,2} (t)
0 -0,12022 0,04180 -0,07842 -0,07842
5 0,55274 0,25861 0,81135 0,81135
10 1,03134 0,43341 1,46475 1,46475
Thus the constructed cooperative solution is dynamically stable.
References
Bellman, R. (1957). Dynamic Programming. Princeton University Press: Princeton, NJ.
Chaplygin, S. A. (1950). A new method of approximate integration of differential equations. M.-L.: GosTehlzdat. 103 P.
Kleimenov, A. F. (1993). Nonantagonistic positional differential games. Yekaterinburg "Science". P. 184.
Kolabutin, N. V. (2008). Quantitative modeling of dynamically stable joint venture. Saint-Petersburg: publishing house "Dom" Saint-Petersburg State Unversity, Proceedings of 39 international science conference graduate and undergraduate students, pp. 47-51.
Kolabutin, N. V. (2015). The two-level cooperation in the differential game technology alliance. Vestnik St. Petersburg State University, cer. 10, 1, 42-63.
Kostyunin, S. Y. , Shevkopljas, E. V. (2011). On the simplification of the integral payoff in differential games with random duration. Vestnik of Saint-Petersburg State Unversity. Ser 10: Applied Mathemtics, Informatic, Control processes., 4, 47-56.
Krasovsky, N.N., Subbotin, A.I. (1974). Positional differential games. .: Science. 458 P.
Krasovsky, N.N., Kotelnoikova, A.N. (2010). On a differential interception game. Uro RAS, 16(5), 113-126.
Kozlovskaya, N. V., Petrosyan, L. A., Ilina, A. V. (2010). Coalition solution in a problem of pollution cost reduction. Vestnik St. Petersburg State University, cer. 10, 2, 46-59.
Petrosyan, L.A., Sedakov, A. A., Surin, A. N. (2006). Multistage game with coalition structure. Vestnik of Saint-Petersburg State Unversity. Ser 10: Applied Mathemtics, Informatic, Control processes, 4, 97-110.
Petrosyan, L. A., Danilov, N.N. (1982). Cooperative differential game and their applications. Tomsk: Ed Tomsk University. 273 P.
Petrosyan, L. A. (1977). Stability of solutions of differential games with many participants Vestnik Leningrad State University. cer. 1, 4(19), 46-52.
Petrosyan, L. A. (1993). Differential Games of Pursuit. World Scientific, Singapore. 326 P.
Petrosyan, L. A., Zaccour, G. (2003). Time-consistent Shapley value allocation of pollution cost reduction. Journal of economic dynamics and control, 27(3), 381-398.
Petrosyan, L. A., Zenkevich, N. A. (1996). Game Theory. World Scientific Publishing. Singapore. 352 P.
Yeung, D.W. K., Petrosyan, L. A. (2001). Proportional time-consistent solution in differential games. Yanovskaya E.B. (ed) International Conference on Logic Game Theory and Social Choice, pp. 254-256.
Yeung, D.W. K., Petrosyan, L. A. (2006). Cooperative Stochastic Differential Games. Springer. 253 P.
Yeung, D.W. K., Petrosyan, L. A. (2012). Subgame Consistent Economic Optimization. Springer. 403 P.
Zenkevich, N. A., Kolabutin, N. V. (2007). Quantitative Modeling of Dynamic Stable Joint Venture. Preprint Volume of the 11th IFAC Symposium " Computational Economics and Financial and Industrial Systems ", IFAC, Dogus University of Istanbul, Turkey, pp. 68-74.
Zenkevich, N.A., Petrosyan, L. A. (2007). The problem of time consistency of cooperative solutions in the management. Vestnik of Saint-Petersburg State Unversity. Ser 8: Management. 1, 7-42.