Subgame Consistent Cooperative Solutions in Stochastic Differential Games with Asynchronous Horizons and Uncertain Types of Players
David W. K. Yeung
Department of Business Administration,
Hong Kong Yan University, Hong Kong;
Center of Game Theory, St Petersburg University,
St Petersburg, Russia E-mail: [email protected]
Abstract This paper considers cooperative stochastic differential games in which players enter the game at different times and have diverse horizons. Moreover, the types of future players are not known with certainty. Subgame consistent cooperative solutions and analytically tractable payoff distribution mechanisms leading to the realization of these solutions are derived.
This analysis widens the application of cooperative stochastic differential game theory to problems where the players’ game horizons are asynchronous and the types of future players are uncertain. It represents the first time that subgame consistent solutions for cooperative stochastic differential games with asynchronous players’ horizons and uncertain types of future players are formulated.
Keywords: Cooperative stochastic differential games, subgame consistency, asynchronous horizons, payment distribution mechanism.
AMS Subject Classifications. Primary 91A12; Secondary 91A25.
1. Introduction
In many game situations, the players’ time horizons differ. This may arise from different life spans, different entry and exit times in different markets, and the different duration for leases and contracts. Asynchronous horizon game situations occur frequently in economic and social activities. Moreover, only the probability distribution of the types of future players may be known. In this paper, we consider cooperative stochastic differential games in which players enter the game at different times and have diverse horizons. Moreover, the types of future players are not known with certainty.
Cooperative games suggest the possibility of socially optimal and group efficient solutions to decision problems involving strategic action. In dynamic cooperative games, a stringent condition for a dynamically stable solution is required: In the solution, the optimality principle must remain optimal throughout the game, at any instant of time along the optimal state trajectory determined at the outset. This condition is known as dynamic stability or time consistency. The question of dynamic stability in differential games has been rigorously explored in the past three decades. (see Haurie (1976), Petrosyan and Danilov (1982) and Petrosyan (1997 )). In the presence of stochastic elements, a more stringent condition - that of subgame consistency - is required for a dynamically stable cooperative solution. In particular, a cooperative solution is subgame-consistent if an extension of the solution policy to
a situation with a later starting time and any feasible state brought about by prior optimal behavior would remain optimal. In particular dynamic consistency ensures that as the game proceeds players are guided by the same optimality principle at each instant of time, and hence do not possess incentives to deviate from the previously adopted optimal behavior. A rigorous framework for the study of subgame-consistent solutions in cooperative stochastic differential games was established in the work of (Yeung and Petrosyan (2004, 2005 and 2006). A generalized theorem was developed for the derivation of an analytically tractable “payoff distribution procedure” leading to subgame consistent solutions.
In this paper, subgame consistent cooperative solutions are derived for stochastic differential games with asynchronous players’ horizons and uncertain types of future players. Analytically tractable payoff distribution mechanisms which lead to the realization of these solutions are derived. This analysis extends the application of cooperative stochastic differential game theory to problems where the players’ game horizons are asynchronous and the types of future players are uncertain. The organization of the paper is as follows. Section 2 presents the game formulation and characterizes noncooperative outcomes. Dynamic cooperation among players coexisting in the same duration is examined in Section 3. Section 4 provides an analysis on payoff distribution procedures leading to subgame consistent solutions in this asynchronous horizons scenario. An illustration in cooperative resource extraction is given in Section 5. Concluding remarks and model extensions are given in Section 6.
2. Game Formulation and Noncooperative Outcome
In this section we first present an analytical framework of stochastic differential games with asynchronous players’ horizons, and characterize its noncooperative outcome.
2.1. Game Formulation
For clarity in exposition and without loss of generality, we consider a general class of stochastic differential games, in which there are u +1 overlapping cohorts or generations of players. The game begins at time ti and terminates at time tu+i. In the time interval [ti,t2), there coexist a generation 0 player whose game horizon is [ti, t2) and a generation 1 player whose game horizon is [ti,t3). In the time interval [tfc,tfc+i) for k €{2, 3, ■ ■■ ,u —1}, there coexist a generation k — 1 player whose game horizon is [tfc_i,tfc+i) and a generation k player whose game horizon is [tk,tk+2). In the last time interval [t„ ,tu+i], there coexist a generation u — 1 player and a generation u player whose game horizon is just [t„,tu+i].
For the sake of notational convenience in exposition, the player who enters the game at time tk can be of types wak€ {coi,w2, ■ ■ ■ ,w^k}. When the game starts at initial time ti, it is known that in the time interval [ti,t2), there coexist a type wi generation 0 player and a type w2 generation 1 player. At time ti, it is also known that the probability of the generation k player being type wak € {wi, w2, ■ ■ ■ , w } is Aak€ {Ai, X2, ■ ■ ■ , \ }, for k € {2, 3, ■ ■ ■ ,u}. The type of generation k player will become known with certainty at time tk.
The instantaneous payoff functions and terminal rewards of the type wak generation k player and the type wak_i generation k — 1 player coexisting in the time interval [tfc ,tk+i) are respectively:
gk i(Uk-l')[s,x(s),u<k_:11’O')Uk (s),u<k’k ’Y)uk 1 (s)] and qk i(Uk-l) [tk+i ,x(tk+i)],
and gk(uk [s,x(s),uikk-1’O)uk (s),u[Uk,Y)uk-1 (s)] and qk(^k) [tk+2,x(tk+2)], k i k (2.1) for k €{1, 2, 3, ■ ■ ■ , u},
where u'kk-1'O^k (s) is the vector of controls of the type wak_i generation k — 1 player when he is in his last (old) life stage while the type wak generation k player is coexisting;
and ukk'Y')Uk-1 (s) is that of the type wak generation k player when he is in his first (young) life stage while the type wak_ 1 generation k — 1 player is coexisting.
Note that the superindex “O” in u£lk-1'O'^Uk (s) denote Old and the superindex
“Y” in ukk,Y')Uk-1 (s) denote young. The state dynamics of the game is characterized by the vector-valued stochastic differential equations:
= f[sTx{s)/^k-i 1'°')UJk(s)/u!lk'Y')UJk 1(s)]ds+cr[s, x(s)]cfc(s),x(ti) = x0 G X, s (2.2) for s € [tk,tk+i),
if the type wak generation k player and the type wak_ 1 generation ak_i player coexisting in the time interval [tk ,tk+i) for k € {1,2,3, ■ ■ ■ ,u}, and where
<r[s, x(s)] is a n x O matrix and z(s) is a 0-dimensional Wiener process. Let ^[s,x(s)] = <r[s, x(s)]<r[s, x(s)]’ denote the covariance matrix with its element in row h and column Z denoted by QhZ[s,x(s)].
In the game interval [tk ,tk+i) for k €{1, 2, 3, ■ ■ ■ ,u — 1} with type wk_ i generation k — 1 player and type wk generation k player is of, the type wk_ i generation k — 1 player seeks to maximize the expected payoff:
f n tk+1
E | J gk_i(uk-1)[s, x(s), uk_k-1’O)uk (s), u{kk’Y )uk-1 (s)] e_r(s_tk) ds
+e_r(tk+1_tk)qk_i^k-1) [tk+i,x(tk+i)]
Ktk) = x € X
(2.3)
and the type wk generation k player seeks to maximize the expected payoff:
E{ I ^ gk(cJk) [s,x(s),uk‘_k-1 (s),u{kk ,Y )Uk-1 (s)]e_r(s_tk )ds
tk k-i k
+ E Aak+Jtk2 gk(Uk) [s,x(s),
k + 1 , ,
=i J tk+1
(uk’O)ua
k
(s),uk+Y)uk (s)]e_r(s_tk)ds
+e_r(tk+2_tk)qk(uk )[tk+2,x(tk+2)]
x(tk) = x € X
(2.4)
subject to stochastic dynamics
dx(s) _ (uh-1,O)uh/ (uh,y)uh-1 .
ds
f [s,x(s),wh-i 11 h (s),uhh)Wh 1 (s)]ds + <r[s,x(s)]dz (s),x(tk )= x,
for s € [th, th+1) and h € {k, k + 1, • • • , u}, where r is the discount rate.
In the last time interval [tv ,tU+1] when the generation u — 1 player is of type wv_ i and the generation u player is of type uv, the type 1 generation u — 1 player seeks to maximize the expected payoff:
f O tu + 1
E^ J gv—1(cj‘v-1) [s, x(s), uU---1 ’O)Uu (s),uU>u’Y )Uu-1 (s)] e-r(s-tu )ds
z(tv) = x € X > , (2.5)
+e-r(tu+1-tu)qU-1(“*-1) [tu+1,x(tu+1)] and the type uv generation u player seeks to maximize the expected payoff:
E
gu(uu) [s, x(s), uU?u-1'O)Uu (s), uUifu,Y^u-1 (s)] e-r(s-tu)ds
+ e-r(tu+1-tu )[tu + 1,x(tu + 1)]
subject to the stochastic dynamics
x(tv) = x € X
(2.6)
= f[s, x(s), u£>v,Y'>u>'> 1 (s)\ds + <t[s, x(s)]dz(s), x(tv) = x,
for s € [tu,tv+{].
The game formulated in (2.1)-(2.6) is an extension the Yeung (2011) analysis to a game with stochastic dynamics. It has the characteristics of the finite overlapping generations version of Jrgensen and Yeung’s (2005) infinite generations game.
2.2. Noncooperative Outcomes
To obtain a characterization of a noncooperative solution to the asynchronous horizons game in Section 2.1 we first consider the solutions of the games in the last time interval [tu,tU+1], that is the game (2.5)-(2.6). One way to characterize and derive a feedback solution to the games in [tu ,tU+1] is to invoke the conventional approach in solving a standard stochastic differential game and obtain:
Lemma 2.1. If the generation u — 1 player is of type wv_ 1 € {lu^w^, • • • ,uq 1} and the generation u player is of type uv€ {w^w^^ • • ,u^^} in the time interval [tu ,tv+1], a set of feedback strategies {^U’-i1'0^^ (t,x);4>Ufu ,Y )Uu-1 (t,x)} constitutes a Nash equilibrium solution for the game (5)-(6), if there exist twice continuously differentiable functions Vv-1(Uu-1'0)Uu (t,x) : [tu ,tv+1] x Rm ^ R and
Vv(Uu,Y)Uu-1 (t, x) : [tu, t+1] x Rm^ R satisfying the following partial differential
equations:
h,Z=1
= max { gu-1(Uu-1')\t, x, uv-1, 4>vu’Y')Uu-1 (t, x)]e-r(t-tu')
Uu-1 I
+Vr1(Uu-1,O)Uu (t,x) f [t,x,uv-i, 4>vu'Y)Uu-1 (t, x)] } , Vu-1^v-i,o)^v (tv+1,x) =e-r(tu+i-tu)qv-1(uu-i)(tv+1,x), and
.. m
h,Z=1
= ma^ gv(Uu) [t, x, 4^-1 1 ’O^'u (t, x), uv]e-r(t-tu)
Uu I
+v^u’Y)“u-(t,x) f [t,x,4>i-:-1’O)Uu(t,x),u]},
Vu (^u’Y )“u-1 (tv+1,x) = e-r(tu+1-tu )qu (uu) [tv+1,x(tv+1)]. (2.7)
Proof. Follow the proof of Theorem 6.27 in Chapter 6 of Basar and Olsder (1999).
□
For ease of exposition and sidestepping the issue of multiple equilibria, the analysis focuses on solvable games in which a particular noncooperative Nash equilibrium is chosen by the players in the entire subgame.
We proceed to examine the game in the second last interval [tv-1,tv). If the generation u — 2 player is of type ww_2 € {w1, w2, • • • , } and the generation u — 1
player is of type ww_-€ {w1,w2, • • • ,wq}. The type ww_2 generation u — 2 player seeks to maximize:
£ ( /t u gv-2(uu-2)[s, x(s), u{—2’O)uu-1 (s), U—1^u-2 (s)]e-r(s-tu-1)ds
t 1
+e-r(tu-tu-1)qv-2(u u-2)[tu ,x(tv)]
x(tv-1) = x € X \. (2.8)
As shown in J0rgensen and Yeung (2005) the terminal condition of the type wv_ 1 generation u — 1 player in the game interval [tv-1,tv) can be expressed as:
Zu
J2KVv-1(uu-1 ’O)^a (tv ,x). (2.9)
a=1
Therefore the type wv_ 1 generation u — 1 player then seeks to maximize:
e[ gv-l(u—)ls,x(s),u{—2'O)^-1 (s),u(uUJ--1’Y )Uu-2 (s)]e-r(s-tu-1) ds
I J tv-1
+e-r(tu-tu-l) £ KVu-1(uu-i’O)u* (tu,x(tu)) x(tu-x) = x e x |
a—1 '
Similarly, the terminal condition of the type wk generation k player in the game interval [tk,tk+1) can be expressed as:
Sk + l
Y^XaVk(uk ,O^a (tk+1 ,x), for k e{1, 2,- ■ ■ ,u — 3}. (2.10)
a—1
Consider the game in the time interval [tk, tk+1) involving the type wk generation k player and the type wk_ 1 generation k — 1 player, for k e{1, 2, ■ ■ ■ ,u — 3}. The type u>k_ i generation k — 1 player will maximize the payoff
E { J^1 gk-1(uk-i\[s, x(s), utkrOWk (s), u(“k ’Y )uk-1 (s)] e-r(s-tk) ds
x(tk) = x e X \ , (2.11)
+e-r(tk+1-tk)qk-1(^-1 )[tk+i,x(tk + i)]
and the type wk generation k player will maximize the expected payoff:
e{ gk(uk) [s,x(s),uk-T ’O)Uk (s),utk Y )uk-1 (s)] e-r(s-tk) ds
Sk+1 'j
+e-r(tk+1-tk)Y^/ \aVk(uk°’)u« (tk+1,x) x(tk )= x e x I
a— 1 '
(2.12)
subject to (2.2) with x(tk) = x.
A Nash equilibrium solution to the game (2.11)-(2.12) can be characterized as:
Lemma 2.2. A set of feedback strategies {rf>k—k-1’O')U'k (t, x); 4,<k>k’Y ')Uk-1 (t, x)} con-
stitutes a Nash equilibrium solution for the game (2.11)-(2.12), if there exist continuously differentiable functions V k-1(Uk-1’O)Uk (t,x) : [tk ,tk+1] x Rm ^ R and
Vk(Uk’Y')Uk-1 (t,x) : [tk,tk+1] x Rm^ R satisfying the following partial differential
equations:
_vk-i ^ Q^^x)VXTk ^x)
h’Z—1
max | gk 1(Uk-1')[t,x,uk-1,4>'kk’Y')Uk 1 (t,x)]e r(t tk)
uk-1 v
+V—Uk-1’O)Uk f [t,x,uk-1,4>kk'YW-1 (t,x)] } ,
Vk-1(^k-1 ’O)^k (tk+1,x) = e-r(tk+1-tk)qk-1(uk-1) (tk+1,x), and
.. m
h’Z—1
= max j g(k’Uk^[t,x,4>kk^k:-1 ,O)Uk (t,x),uk }e-r(t-tk)
uk v
^M^k ’Y)uk-1 rr. A^k-1’O)uku ^ l 1
+ Vx f [t,x,0k-1 (t,x),uk]j,
Sk+1
Vk(^k’YW-1 (tk+1, x) = e-r(tk+1-tk)Yj AaVk(uk’O’^a (tk+1,x),
a—1
for k e {1, 2,- ■ ■ ,u — 1}. (2.13)
Proof. Again follow the proof of Theorem 6.16 in Chapter 6 of Basar and Olsder (1999). □
A theorem characterizing the noncooperative outcomes of the game (2.2)-(2.6) can be obtained as:
Theorem 2.1. A set of feedback strategies {p>k—k-1’O')U'k (t,x);pk’k Y ')Uk-1 (t,x)} constitutes a Nash equilibrium solution for the game (2.2)-(2.6), if there exist continuously differentiable functions Vk-1(^k-1’O^Uk (t,x) :[tk ,tk+1] x Rm ^ R and Vk(Uk’Y)Uk-1 (t,x) :[tk,tk+1] x Rm^ R satisfying the following partial differential equations:
.. m
h’Z—1
= ma^ gu-1(Uu-1^[t, x, uu-1, ’Y ')Uu-1 (t, x)]e-r(t-tu')
Uu-1 I
+Vr1(Uu-1’O)Uu (t,x) f [t,x,uu-1, 'Y)Uu-1 (t, x)] } ,
Vu-1(^v-1’O)^ (tu+1,x) = e-r(tu+1-tu)qu-1(uv-1 )(tu+1,x), and
m
h’Z—1
= maJ gu(uu) [t, x, p"--1 ’O)"u (t, x), u]e-r(t-t^)
Uu I
+Vu^’Y )“u-1 (t,x) f [t,x,pu--1’O)Uu (t,x),uu ] } ,
Vu(“u’Y^u-1 (tu+1,x) = e-r(tu+1-tu)qu(“v) [tu+1,x(tu+1)]; (2.14)
h,Z=1
= max { gk-1(uk-l)[t, x, uk-U $k’Y]^k-1 (t, x)]e-r(t-tk)
Uk-1 (
, T/k-1(uk-1,°)uk rr. A^k ’Y )uk-1(,
+ Vx f [t,X,Uk-1,pk (t,x)] > ,
Vk-1(uk-i,°)“k (tk+1,x) = e-r(tk+1-tk)qk-1(uk-i\tk+1,x), and
-V?<‘Uk'Y)Uk-‘(M)
max
Uk
h,Z=1
{ g(k’uk)[t, x, 4>tV’°)Uk (t, x), uk]e-r(t-tk)
+VkUk,Y)Uk-1 f [t,x,tk-k-1’°)Uk (t,x),Uk] } ,
Sk + 1
Vk^k,Y^k-1 (tk+1,x) = e-r(tk+1-tk) AaVk(uk,°>* (tk+1 ,x),
a=1
for k G{1, 2,- • • ,u — 1}. (2.15)
Proof. The results (2.14) follows from Lemma 1 and those in (2.15) follows from Lemma 2.2. □
Using Theorem 2.1 one can obtain a non-cooperative game equilibrium of the game (2.2)-(2.6).
3. Dynamic Cooperation among Coexisting Players
Now consider the case when coexisting players want to cooperate and agree to act and allocate the cooperative payoff according to a set of agreed upon optimality principles. The agreement on how to act cooperatively and allocate cooperative payoff constitutes the solution optimality principle of a cooperative scheme. In particular, the solution optimality principle for the cooperative game includes (i) an agreement on a set of cooperative strategies/controls, and (ii) an imputation of their payoffs.
Consider the game in the time interval [tk, tk+1) involving the type wk generation
k player and the type wk_ 1 generation k — 1 player. Let w^>k-1'u>k) denote the probability that the type wk generation k player and the type wk_ 1 generation k — 1 player would agree to the solution imputation
[^k-1(^k-1 ,°)uk[£] (t,x),£k(Uk,Y)Uk-1[e\t,x)] over the time interval [tk,tk+1), ‘'(u’k-1,u’k') (^ ^ ) where wi k-1’ k = 1.
h=1
At time t1, the agreed-upon imputation for the type w1 generation 0 player and the type w2 generation 1 player are known.
The solution imputation may be governed by many specific principles. For instance, the players may agree to maximize the sum of their expected payoffs and equally divide the excess of the cooperative payoff over the noncooperative payoff. As another example, the solution imputation may be an allocation principle in which the players allocate the total joint payoff according to the relative sizes of the players’ noncooperative payoffs. Finally, it is also possible that the players refuse to cooperate. In that case, the imputation vector becomes [Vk-1(^k-1,O)Uk (t,x),
Vk(Uk’Y)Uk-i (t,x)].
Both group optimality and individual rationality are required in a cooperative plan. Group optimality requires the players to seek a set of cooperative strategies/controls that yields a Pareto optimal solution. The allocation principle has to satisfy individual rationality in the sense that neither player would be no worse off than before under cooperation.
3.1. Group Optimality
Since payoffs are transferable, group optimality requires the players coexisting in the same time interval to maximize their expected joint payoff. Consider the last time interval [t^,tu+1\, in which the generation u — 1 player is of type 1 G {w1, w2, • • • , } and the generation u player is of type G [cj1, w2, • • • , }. The
players maximize their expected joint payoff:
e{ J”+1 ^gu-1(-u-i)[s,x(s),uiUJ_u^1’O)Uu (s),uUfu ,Y ^-1 (s)\
+gu (u* )[s,x(s),u[--1 ’O)Uu (s),uU?u ,Y )Uu-1 (s)\) e-r(s-tu )ds +e-r(tu+1-tu) {qv-1(^-i)[tv+1,x(tv+1)\ + )[tv+1,x(tv+1 )\) x(tv) = x G X J
(3.1)
subject to (2.2) with x(tv) = x.
Invoking the technique of stochastic dynamic programming an optimal solution of the problem (3.1)-(2.2) can be characterized as:
Lemma 3.1. A set of Controls {'^A-1,O^Uu(t, x); ,Y)Uu-1 (t, x)} constitutes an optimal solution for the stochastic control problem (3.1)-(2.2), if there exist continuously differentiable functions W[tu,tu+1\(Uu-1,Uu)(t,x) : [t„,tv+1\ x Rm ^ R satisfying the following partial differentia,l equations:
.. m
- wf^+l](^-1,t0(t,x) -- Y, nH(t,x)Wl££’+lKU',~1’U',)(t>x)
h,Z=1
v- 1(u„,_ „. i -r(
max <J gv-1(uu-i)[t,x,uv_ 1,uu\ e-r(t-tu)
uu— 1 ^
iu(“j[t x _ u \e-r(t-tu) + W ^ ’tu+1 }(uv-1,uv) (
+ gu(^[t, x, uv-1, uv\e-r(t-tu) + WX , + ,(Wu-1’Wu)(t, x)f [t, x, uv-1, uv\
Wltu,tu+1](^u-1,^u) (tv+1,x) = e-r(tu+1-tu)[qu-1(uu-1)(tv+1,x) + qv(uu](tv+1,x)\.
(3.2)
Proof. The results in (3.2) are the characterization of optimal solution to the stochastic control problem (3.1)-(2.2) according to stochastic dynamic programming. □
We proceed to examine joint payoff maximization problem in the time interval [tv-1,tv) involving the type wv_! generation v — 1 player and type 2 generation v — 2 player. A critical problem is to determent the expected terminal valuation to the wv_! generation v — 1 player at time tv in the optimization problem within the time interval [tv-1,tv). By time tv, the wv_1 generation v — 1 player may co-exist with the wvG {w1,w2, ■ ■ ■ ,wq } generation v player with probabilities {A1; X2, ■ ■ ■ , X‘}. Consider the case in the time interval [t„, tv+1) in which the type wv_i generation v — 1 player and the type wv generation v player co-exist. The probability that the type wv_ 1 generation player and the type wv generation player would agree to the solution imputation
[^-^u-lOK ih](t,x),^v ,Y)^u-1lh] (t,x)] is d£u-1'Uu)
where w^u-1'UuU) = 1. (3.3)
In the optimization problem within the time interval [tv-1,tv), the expected terminal reward to the u>v_ 1 generation v — 1 player at time tv can be expressed as:
u-1 'Ma)
a)
E E ^h"-1 ) C-1^-!’0^^] (tv , x) . (3.4)
a=1 h=1
Similarly for the optimization problem within the time interval [tk,tk+1), the expected terminal reward to the wk generation k player at time tk+1 can be expressed
‘k+1 ‘(^k'^a)
E E ^tk'Ua)iki"h’0)"aih](tk+1 ,x), for k G{1, 2, ■ ■ ■ ,H — 3}. (3.5)
a=1 h=1
The joint maximization problem in the time interval [tk ,tk+1), for k G {1, 2, ■ ■ ■ ,v — 3}, involving the type wk generation k player and type wk_ 1 generation k — 1 player can be expressed as:
max E$ f f gk 1(Uk-l)[s,x(s),uk-;-L1,0)uk (s),u{kk ’Y)uk 1 (s)]
uk-i,uk [ J tk \
+ gk(Kk) [s,x(s),u{k-;-1’0)Uk (s),utk’Y)uk-1 (s)] ) e-r(s-tk)ds + e-r(tk+1-tk) ( qk-1(Kk-1) [tk+1, x(tk+1)]
‘k + 1 ‘’(^k'^al
+ E E ^ ’Ua) ^ 0)U* ih](tk+1 ,x(tk + 1)))
(tk )= x G X h (3.6)
subject to (2.2) with x(tk) = x.
The conditions characterizing an optimal solution of the problem (3.6)-(2.2) are given as follows.
Theorem 3.1. A set of Controls {4k-i1'0°Uk (t,x);^k^k,Y')Uk-1 (t,x)} constitutes an optimal solution for the stochastic control problem (3.6)-(2.2), if there exist continuously differentiable functions W [tk,tk+1'](Uk-1’Uk) (t, x) : [tk ,tk+1) x Rm ^ R satisfying the following partial differential equations:
.. m
-Wf^+l](^-1,t0(t,x) - 2 E nH(t>x)Wx*£’+lKU',~1'U,,)(t’x)
h,Z=1
max gu 1(Uu-1^[t, x,uv_ 1,uu] e r(t tu)
uu— 1 ,uu
+gv(uu)[t, x, uv_ 1, u ]e-r(t-tu) + wXtu ’tu+1^(Uu-1’Uu) (t, x)f [t, x, uv_ 1, u ]
W [tu tu+^u-i’Uu) (tv+1,x) = e-r(tu+1-tu )[qu-1^v-i) (t,J+1 ,x) + qv(u"] (tv+1, x)];
for k G {1, 2, ■ ■ ■ ,v — 1}:
_Wt[tfc’tfc+1](“fc-1’a'fc)(i,x) - - J2 &C(t,x)W^£k+lKUk-1,Uk\t,x)
h,Z=1
= max { gk-1(Uk-1^[t,x,uk_ 1,uk]e-r(t-tk) uk— 1 ,uk
+gkK)[t, x, uk-1, uk]e-r(t-tk) + witk’tk+1KUk-1’Uk\t, x) f [t, x, uk-1, uk] } ,
W[tk,tk+1l(Kk-1,^k)(tk+1,x) = e-r(tk+1-tk) ( qk-1(uk-1)(tk+1 ‘k+1 ‘(“k'“a)
+ E E ^k ’U°) ^ ’0)Ua[h](tk+1 ,x)) . (3.7)
a=1 h=1
Proof. Invoking the standard technique of stochastic dynamic programming we obtain the conditions characterizing an optimal solution of the problem (3.6)-( . ) as in (3.7). □
Substituting the set of cooperative strategies into ( . ) yields the dynamics of the cooperative state trajectory in the time interval [tk,tk+1)
~j~ = f[six(s)i^-i1’0)Uk(six(s))^<k’k’Y)Uk~1(six(s))\ + a[six(s)\dz(s)i (3-8)
for s G [tk, tk+i), k G{1, 2, ■ ■ ■ ,v] and x(ti) = x0.
We denote the set of realizable states at time t from (3.8) under the scenarios of different players by x{tk,tk+1^(lJk,lJk+1)*, for t G [tk,tk+1) and k G {1, 2, ■ ■ ■ ,v}. We use the term x{tk,tk+1^(^k,^k+1> Gx{tk'tk+l]{uk'Uk+l)* to denote an element in xjtk,tk+i]{^k^k+1)*. The term x* is used to denote x{tk,tk+1^(uk’Uk+1')* whenever there is no ambiguity
To fulfill group optimality, the imputation vectors have to satisfy:
£k-1(^k-1,o^k №(t,x*) + ik(uk Y )uk-1[e\(t,x*t) = W[tk ’tk+1^k-1,^k) (t, x*), (3.9)
for t G [tk,tk + 1), WkG {^i,^2^ ■ ■ ,Wsk ] Wk-1G {w1,w2^ ■ ■ ,W<;k-1 ]
t G{1, 2, ■ ■ ■ , S(Uk-1,Uk)} and k G{0,1, 2, ■ ■ ■ , v], where x* is the short form for xfk-1'Uk')*.
3.2. Individual Rationality
In a dynamic framework, individual rationality requires that the imputation received by a player has to be no less than his noncooperative payoff throughout the time interval in concern. Hence for individual rationality to hold along the cooperative trajectory { x(Uk-1,Uk')*(t)} ,
£k-1^k-1 ,o)^k[e] (t, x*) > Vk-1^k-1,o^k (t,x*) and
^k,y)^k-1[e] (t,x*) > Vk(uk Y)^k-1 (t,x*), (3.10)
for t G [tk ,tk + 1), Wk G {w1,w2^ ■ ■ ,W<;k ] Wk-1 G {wi,w2r ■ ■ ,W<;k-1 ] t G{1, 2, ■ ■ ■ , S(uk-1,uk)} and k G{0,1, 2, ■ ■ ■ ,v}.
For instance, an imputation vector equally dividing the excess of the cooperative payoff over the noncooperative payoff can be expressed as:
£k-1^k-1,o)^k\?](t, x*) = Vk-1(uk-10)^k (t, x*) + 0.5[W[tk,tk+1K^k-1,^k)(t, x*)
- vk-1(uk-1,o^k (t,x*) - V k(uk,Y)»k-1 (t,x*)], and
£k{^k ,Y W-1[£](t,x*) = V k(uk,Y )^k-1 (t,x*) +0.5[W [tk,tk+1](^k-1,^k)(t,x*)
- vk-1(^k-1,0)^k (t, x*) - Vk(uk,yfrk-1 (t, x*)]. (3.11)
One can readily see that the imputations in (3.11) satisfy individual rationality and group optimality.
4. Subgame Consistent Solutions and Payoff Distribution
A stringent requirement for solutions of cooperative stochastic differential games to be dynamically stable is the property of subgame consistency. Under subgame consistency, an extension of the solution policy to a situation with a later starting time and any feasible state brought about by prior optimal behaviors would remain optimal. In particular, when the game proceeds, at each instant of time the players are guided by the same optimality principles, and hence do not have any ground for deviation from the previously adopted optimal behavior throughout the game.
According to the solution optimality principle the players agree to share their cooperative payoff according to the imputations
,0)uk [£] x * ),£k(^k,Y )uk-1[e] (t, x * )]
(4.1)
over the time interval [tk,tk+1).
To achieve dynamic consistency, a payment scheme has to be derived so that imputation (4.1) will be maintained throughout the time interval [tk,tk+1). Following Yeung and Petrosyan (2004 and 2006) and Yeung (2011), we formulate a payoff distribution procedure (PDP) over time so that the agreed imputations (4.1) can be
(s) and Bkk’ ^k-1 (s) denote the instantaneous pay-
realized. Let B{Uk-1’0)Uk[£]s v(^k’Y
Jk-1
ments at time s G [tk,tk+1) allocated to the type wk_ 1 generation k — 1 (old) player and type wk generation k (young) player.
In particular, the imputation vector can be expressed as:
£k-1(^k-i’0)^k[z\(t,x*) = B<jk--1-1’0)Uk[ei(s) e-r(s-tk)d
+ e-r(tk+1-tk)qk-1(^k-1)[fk+1,x* (tk+1)]
tk + 1 i
x(tk) = x * G X
£k(^’YW-1[£](f,x*) = e| £
+ E E
B
A^k ’Ua) fk(Uk’0)Ua [£]
^k’0)^« [*](tk + 1,x * (tk+1))
a=1 £=1
](s) e-r(s-tk)ds
:(tk)= x* G X , (4.2)
for k G {1, 2, • • • ,u — 1}, and
^-1(^v-1’0)^v[e](t,x*t ) = e| £
B^-1’0^”M(s) e-r(s-tu)ds
+ e-r(tu+1-tu )qu-1(^u-1) [tv+1,x* (tv+1)]
x(tv) = x* G X
C(uv ’Y )uu-1[e](t,x*) = E
B^’Y^u-M(s) e-r(s-tu)ds
+ e
-r(tu+1-tu) nv(uu )
qv(Uu) [tu+1,x * (tu+1)]
C(tu) = x* G X \ ■ (4.3)
Using the analysis in Yeung and Petrosyan (2006) and Petrosyan and Yeung (2007) we obtain:
u+1
t
v + 1
Theorem 4.1. If the imputation vector [£k-1(“k-1 ’0)^k M(t, x* ),^k(^k ’0)Uk-1№(t, x*)] are functions that are continuously differentiable in t and x*, a PDP with an instantaneous payment at time t G [tk,tk+1):
(uk-i,O)ukj-k~1(uk-i,O)ukIp] ,
Bk-1 (t) = S t (t, xt )
m
h,Z=1
- St 1(--k-l’O^e\t, x*t )f [t, x *t ,^tkrO)Uk (t, x*t), ^k ,Y )Uk-1 (t, x*t)] (4.4)
allocated to the type &k~ 1 generation k — 1 player; and an instantaneous payment at time t G [tk,tk+i):
h,Z=1
—sTk ,Y )Uk-iW(t,xt )f [t,xt ,4-T,O)Uk (t,xt ),4Uk ,Y )Uk-1 (t,xt)]
allocated to the type &k generation k player,
yields a mechanism leading to the realization of the imputation vector
[£k-1(wk-1 ,O)^k W (t, xt ),Sk(Uk ,Y)uk-1[£] (t, xt )], for £ G{1, 2, • • • ,^(Uk-i,Uk)} and k G{1,2, • • • ,v}.
Proof. Follow the proof leading to Theorem 4.4.1 in Yeung and Petrosyan (2006) with the imputation vector in present value (rather than in current value). □
5. An Illustration in Resource Extraction
Consider the game in which there are 4 overlapping generations of players with generation 0 and generation 1 players in [t1,t2), generation 1 and generation 2 players in [t2, t3), generation 2 and generation 3 players in [t3,t4]. Players are of either type 1 or type 2. The instantaneous payoffs and terminal rewards of the type
1 generation k player and the type 2 generation k player are respectively:
(u
£)1/2 - ~Yj^uk and (?!X1/2; and (uk)l/2 - and q2x1/2.
x x (5.1)
At initial time ti, it is known that the generation 0 player is of type 1 and the generation 1 player is of type 2. It is also known that the generation 2 and generation 3 players may be of type 1 with probability A1 = 0.4 and of type 2 with probability A2 =0.6.
The state dynamics of the game is characterized by the stochastic dynamics:
—— = [ax(s)1/2 — bx(s) — uk-i(s) — uk(s)]ds + ax(s)dzts), x(t\) = xn G X C R, ds
(5.2)
for s G [tk, tk+1) and k G{1, 2, 3}.
The game is an asynchronous horizons version of the synchronous-horizon resource extraction game in Yeung and Petrosyan (2006) and an extension of the
Yeung (2011) analysis to include stochastic dynamics. The state variable x(s) is the biomass of a renewable resource. uk(s) is the harvest rate of the generation k extraction firm. The death rate of the resource is b. The rate of growth is a/x1/2 which reflects the decline in the growth rate as the biomass increases. The type i G {1, 2} generation k extraction firm’s extraction cost is ciuk(s)x(s)-1/2.
This asynchronous horizon game can be expressed as follows. In the time interval [tk,tk+i), for k G {1, 2}, consider the case with a type i G {1, 2} generation k — 1 firm and a type j G {1, 2} generation k firm, the game becomes
( rt-k+i
max E < /
uk-i I J tk
i”K“w]‘/2 -
x( s )
exp[—r(s — tk)] ds
+ exp[-r(tk+1 -tk)]qix(tk+1)2
max E
uk tk
2
tk+i
r (j,Y)i/ m1/2 cj (j,Y)i
K (g)]! - , : r"~
x( s )
,1/2
u2 (s
(s)
+ J2 xo
a=1
t3
exp[—r(s — tk)] ds exp[—r(s — tk)] ds
, \1/2 k x(s)
+ exp[-r(tfc+2-tfc)]^x(tfc+2)* } , (5.3)
subject to (5.2).
In the time interval [t3,t4], in the case with a type i G {1, 2} generation 2 firm and a type j G {1, 2} generation 3 firm, the game becomes
max E
u2
r t4
t3
\>Ul'°)j (s)]1/2 —
1/2
2
max E
u3
x(s)
+ exp [—r(t4 -t3)] %x(t4)5
t4
(s)
exp[—r(s — t3)] ds
x(ts) = x
t3
[ujO)i(s)]1/2 —
c
-3
Un
(s)
x(s)1/2 + exp [—r(t4 - t3)] qjx(U)i
exp[—r(s — t3)] ds
x(t3) = x \ , (5.4)
subject to (5.2) with x(t3) = x.
5.1. Noncooperative Outcomes
In this section we characterize the noncooperative outcome of the asynchronous horizons game (5.2)-(5.4).
Proposition 5.1. The value functions for the type i G {1, 2} generation k — 1 firm and the type j G {1, 2} generation k firm coexisting in the game interval [tk,tk+i) can be obtained as:
Vk-1(i,O)j(t, x)= exp[—r(t — tk)] A(t)x1/2 + 0^°(t) , and
Vk(j,Y)i(t, x) = exp[—r(t — tk)] A[j,Y')l(t)x1/2 + Cjkj'Y)i(t)
for k G {1, 2, 3} and i, j G{1, 2}, where
A(!,OJj(t), 0^°(t), AjY^(t) and CjY)i(t) satisfy:
(5.5)
Lk-1
A (i,O)j Ak 1
(t) =
b a2
r H------1----
2 8
A
(i,O)j
k1
(t)-
ci + Aki-O)j (t)/2
+ ■
Ci + AtO)j (t)/2
+
(t)
+
12 r -i 2 ;
8 Ci + Aki!°1)j(t)/2 8 \cj + Aj’Y)i(t)/2
Aki_!°))j(tk+1) = % and C{k°(tk+1) = 0, for k G {1, 2, 3};
(5.6)
1
c
2
2
4
A jY )i(t) =
b a2
r -1----------1--------
2 8
+ -
cj + Ak’Y )i(t)/2
+ -
cj + Akj,Y )i(t)/2
+
Aj’Y )i (t)
ci + A(k°)j (t)/2
= for ke {1,2,3};
A
(j,Y )i
(tk+1) = e-r(tk+1-tk ) ^ XeA(k’O^e (tk+1) and
Cj )i(tk+1) = e-r(tk+1-tk^ A,0kj’O)£(tk+1),
^=1
for k G {1, 2}, and A(’Y')l(t4) = qj and C^j,Y)i(t4) = 0. (5.7)
Proof. Using Lemmas 2.1 and 2.2 and the analysis in Proposition 5.1.1 in Yeung and Petrosyan (2006), one can obtain the value functions in (5.5). □
Following Yeung and Petrosyan (2006) the game equilibrium strategies can be expressed as:
(t,x)
ci + Akk^1^j (t)/2
4
2 and 4>k!'Y ')l(t,x)
4
cj + A(k’Y )i(t)/2
2 •
(5.8)
A complete characterization of the noncooperative market outcome is provided by Proposition 1 and (38).
1
c
j
2
2
4
2
2
8
8
k
x
x
5.2. Dynamic Cooperation
Now consider the case when coexisting firms want to cooperate and agree to act and allocate the cooperative payoff according to a set of agreed upon optimality principles. Let there be three acceptable imputations.
Imputation I: the firms would share the excess gain from cooperation equally with weights wl_1 = wk = 0.5.
Imputation II: the generation k — 1 firm acquires w‘2_1 = 0.6 of the excess gain from cooperation and the generation k firm acquires wk = 0.4 of the gain.
Imputation III: the generation k — 1 firm acquires w3_1 = 0.4 of the excess gain from cooperation and the generation k firm acquires wk = 0.6 of the gain.
In time interval [tk,tk+1), if both the generation k — 1 firm and the generation k firm are of type 1, the probabilities that the firms would agree to Imputations I,
II and III are respectively ^’\1’1') = 0.8, m21’1) = 0.1 and m^’1 = 0.1.
If both the generation k — 1 firm and the generation k firm are of type 2, the probabilities that the firms would agree to Imputations I, II and III are respectively (2’2) (2>2)
(2’2)
1
ro; ' ' = 0.7, m( ’) = 0.15 and m3 ’ ) = 0.15.
If the generation k — 1 firm is of type 1 and the generation k firm are of type 2, the probabilities that the firms would agree to Imputations I, II and III are respectively m^’2) = 0.15, m(1’2) = 0.75 and m^1’2) = 0.1.
If the generation k — 1 firm is of type 2 and the generation k firm are of type 1, the probabilities that the firms would agree to Imputations I, II and III are respectively m12’1) = 0.15, m22’1) = 0.1 and m^’1) = 0.75.
At initial time t1 , the type 1 generation 0 firm and the type 2 generation 1 firm are assumed to have agreed to Imputation II.
Since payoffs are transferable, group optimality requires the firms coexisting in the same time interval to maximize their joint payoff. Consider the last time interval [t3, t4], in which the generation 2 firm is of type i G{1, 2} and the generation 3 firm is of type j G {1, 2}. The firms maximize their expected joint profit
E
’ t3 -t4
+
t3
x(s) 1 x(s)
exp—r(s — t3)] ds exp—r(s — t3)] ds
+ exp —r(t4 - t3)] ^x(t4) 2 + exp —r(t4 - t3)] ^-x(t4) 2 subject to (5.2) with x(t3) = x.
c(t3) =
Proposition 5.2. The maximized joint payoff with type i G {1, 2} generation 2 firm and the type j G {1, 2} generation 3 firm coexisting in the game interval —, t4) can be obtained as:
W[t3’t4^(i’j)(t, x) = exp[—r(t — t3)] \A[t3’t4(i’j)(t)x1/2 + C[t3’t4(i’j)(t)l , (5.9)
x
where A[t3’t4^(i’j)(t) and C[t3’t4(i’j)(t) satisfy:
A [t3’ti](i’j) (t) =
b a2
r -1----------1-------
2 8
A[t3’t4](i’j) (t) —
2 [q + A[t3’t4}(i’j)(t)/2]
1
2 [cj + A[t3’t4(i’j)(t)/2] 4 [a + At^K’^t)^] 2 4 [cj + A^K^(t)/2]
A[t3’ti](i’j) (t) A[t3 ’t4](i’j) (t)
8 [ci + A^K’^t)^]2 8 C + A^K’O (t)/2 2’
A[t3’t4](i’j) (t4) = qi + j and C[t3’t4](i’j) (t4) = 0. (5.10)
Proof. Using Lemma 3.1 and the analysis in example 5.2.1 in Yeung and Petrosyan (2006), one can obtain (5.9)-(5.10). □
The solution time paths A[t3’t4](i’j)(t) and C[t3’t4^(i’j)(t) for the system of first order differential equations in (39)-(40) can be computed numerically for given values of the model parameters r,q1, q2, c1, c2, a and b.
In the game interval [t3, t4) if type i G {1, 2} generation 2 firm and the type j G {1, 2} generation 3 firm coexisting, the imputations of the firms under cooperation can be expressed as:
1
c
j
2
£2(i’0)jie] (t, x) = V‘2(i’0)j(t, x) + w£[W[t3’t4^(i’j) (t, x) — V2(i’O)j(t, x) — V3(j’Y)i(t, x)]
£3(j’Y)i[£](t, x) = v3(j’Y)i(t, x)+ wh[W[t3’t4^(i’j)(t, x) — V2(i’0)j(t, x) — V3(j’Y)i(t, x)]
for £ G {1, 2, 3}. (5.11)
Now we proceed to the second last interval [tk,tk+1) for k = 2. Consider the case in which the generation k firm is of type j G {1, 2} and the generation k — 1 firm is known to be of type i = 2. Following the analysis in (19) and (20), the expected terminal reward to the type j generation k firm at time tk+1 can be expressed as:
^ X£ ^ (tk+1, x), for k = 2. (5.12)
i=1 h=1
A review of Proposition 5.1, Proposition 5.2 and (5.11) shows the term in (5.12) can be written as:
Ak}j’0)x1/2 + Cz(j’0), where Ak(j’0') and CZj'° are constant terms.
(5.13)
The joint maximization problem in the time interval [tk ,tk+1), for k G {1, 2}, involving the type j generation k player and type i generation k — 1 player can be expressed as:
max E • uk-1 ’uk [ J tk
* tk+1
x(s) 1
+
[“"■“'•Ml1'1 -
x(s) '
exp[—r(s — tk)] ds exp[—r(s — tk)] ds
+ exp [—r(tk+1 — tk )] subject to (5.2).
<lix(tk+1) = + Ali3'0)x(tk+1)1/2 + cckb’0)
x(tk) = x
Proposition 5.3. The maximized expected joint payoff with type i G {1, 2} generation k — 1 firm and the type j G {1, 2} generation k firm coexisting in the game interval [tk,tk+1), for k G {1, 2}, can be obtained as:
W[tk’tk+i](i’j)(t, x) = exp[—r(t — tk)] A[tk’tk+^(i’j)(t)x1/2 + C[tk’tk+^(i’j)(t)
where A[tk’tk+^(i’j)(t) and C[tk’tk+^(i’j)(t) satisfy:
(5.14)
A [tk’tk+i](i’j) (t)
b a2
r H------1-----
2 8
A[tk’tk + l](i’j) (t) —
2 [ci + A[tk’tk+i](i’j)(t)/2]
1
ci
■H—:--------:------
2 [cj + A[tk’tk+l](i’j)(t)/2] 4 c + A[tk’tk+i](i’j)(t)/2 2 4 [cj + A[tk’tk+i](i’j)(t)/2]2
A[tk ’tk + l](i’j) (t) A[tk’tk + 1^(i’j) (t)
+ —---------;------~ ^^ +
8 [c,, + A[tk’tk+i](i’j)(t)/2 8 [j + A[tk’tk+i](i’j)(t)/2]
(j[tk ,tk + l] (i,j) = y*(^][tk ,tk+l] (i,j) J^[tk ,tk + l] (i,j)
A[tk ’tk+i](i’j) (tk+1) = qi + Akij’0) and C[tk’tk+^(i’j) (tk+1) = CCk(j’0). (5.15)
Proof. Using Theorem 3.1 and the analysis in example 5.2.1 in Yeung and Petrosyan (2006), one can obtain the results in (5.14) and (5.15). □
The solution time paths A[tk’tk+1^(i’j)(t) and Ctk’tk+1^(i’j)(t) for the system of first order differential equations in (44)-(45) can be computed numerically for given values of the model parameters r, q1, q2, c1, c2, a, b, X1, X2, and mhj’i) for h G {1, 2, 3} and j,£ G {1, 2}.
Following Yeung and Petrosyan (2006) the optimal cooperative controls can then be obtained as:
1
c
j
'lPk-°ij^TX) = -----------------“-----7“-------2’ an<^
4 [q + A[tk’tk+i\(i’j)(t)/2]
x) = —---------------------—-------72- (5.16)
4 [cj + A[tk’tk+^(i’j)(t)/2]
Substituting these control strategies into (5.2) yields the dynamics of the state trajectory under cooperation. The optimal cooperative state trajectory in the time interval [tk,tk+1)can be obtained as:
ds V 4 [ci + A[tk’tk+i\(i’j)(s)/2]
nr \
2 ds + ax(s)dz (s), x(t{) = x0, (5.17)
4 [cj + A[tk’tk+i\(i’j')(s)/2~\ ‘ for s G [tk, tk+1) and k G {1, 2, 3}.
We denote the set of realizable states at time t from (5.17) under the scenarios of different players by X{tk’tk+i^(i’j')*, for t G [tk,tk+1) and k G {1, 2, 3}. We use the term x{tk’tk+i](i’j)* GXltk’tk+i](i’j)* to denote an element in x{tk’tk+i](i’j)*. The term x* is used to denote xJ[ttk’tk+i(l’j')* whenever there is no ambiguity.
5.3. Subgame Consistent Payoff Distribution
According to the solution optimality principle the players agree to share their cooperative payoff according to the solution imputations:
£k-1(i’0)j[i] (t, x) = vk-1(i’0)j (t, x) + wh-1 [W[tk’tk+^(i’j)(t, x)
—V k-1(i’0)j (t,x) — V k(j’Y )i(t,x)],
x
^k(j’Y W](t,x) = V k(j’Y )i(t, x) + wh [W [tk’tk+i](i’j) (t,x)
—v k-1(i’0)j (t,x) — V k(j’Y )i(t,x)], for £ G {1, 2, 3}, i,j G {1, 2} and k G {1, 2, 3}.
These imputations are continuous differentiable in x and t. If an imputation vector [gk-1(i’0)j[£l(t, x),£k(j’Y)i[i (t,x)] is chosen, a crucial process is to derive a payoff distribution procedure (PDP) so that this imputation could be realized for t G [tk,tk+1) along the cooperative trajectory {x* }t1=+lk.
Following Theorem 4.1, a PDP leading to the realization of the imputation vector [£k-1(i’0)j[i (t,x),£k(j’Y)i[i](t, x)] can be obtained as:
Corollary 5.1. A PDP with an instantaneous payment at time t G [tk,tk+1):
B
(i’0)j[i]
k-1
(t) = —£
k-1(i’0)j[i]
k-1(i’0)j[i]
ZhXZ
(t, x *)
h’Z=1
_ck-1(i’0)j[i]
—x
(t, xt )
i(x *)1/2 — bx *
4 [c* + v4[tfc’tfc+1K*.j)(t)/2]2 4 [cj + J4[tfc’tfc+1K*.j)(t)/2]2
(5.18)
allocated to the type i generation k — 1 player; and an instantaneous payment at time t G [tk,tk+1):
x
x
B
2
h’Z=1
£k(j’Y )i[i\
(t, xt )
l(x*)1/2 — bx*
k
4 [ci + A[^-^+i](i.j)(t)/2]2 4 [CJ + A[^-^+i](i-J)(t)/2]2
(5.19)
allocated to the type j generation k player,
yields a mechanism leading to the realization of the imputation vector [£k-1(i’0)m (t,x),ek(j’Y )i[i] (t,x)], for k G {1, 2, 3}, £ G {1, 2, 3} and i,j G {1, 2}.
Since the imputations £k-1(i’0)j[i (t,x) and £k(j’Y)i[i (t,x) are in terms of explicit differentiable functions, the relevant derivatives in Corollary 5.1 can be derived using the results in Propositions 5.1, 5.2 and 5.3. Hence, the PDP (t)and
B>kj'Y^^(t) in (5.18) and (5.19) can be obtained explicitly.
x
x
6. Concluding Remarks and Extensions
This paper considers cooperative stochastic differential games in which players enter the game at different times and have diverse horizons. Moreover, the types of future players are not known with certainty. Subgame consistent cooperative solutions and analytically tractable payoff distribution mechanisms leading to the realization of these solutions are derived. The analysis extends the Yeung (2011) analysis with the incorporation of stochastic dynamics.
The asynchronous horizons game presented can be extended in a couple of directions. First, more complicated stochastic processes can be adopted in the analysis. For instance, the random variable governing the types of future players can be a series of non-identical random variables G Wk,^,- ■ ■ } with probabilities
Xkk G{Xk,Xk,. ■ ■ ,Xk }, for k g{2, 3,- • • ,u}.
Secondly, the overlapping generations of players can be extended to more complex structures. The game horizon of the players can include more than two time intervals and be different across players. The number of players in each time interval can also be more than two and be different across intervals. The analysis can be formulated as a general class of stochastic differential games with asynchronous horizons structure. In particular, the type wak generation k player’s game horizon
is [tk,tk+Vk), where nk > 1. The term u<^k’S1')(s) is used to denote the vector of controls of the type wak generation k player in his first game interval [tk ,tk+1); and
ukk’S2)(s) is that in his second game interval [tk+1, tk+2) and so on. This results in a general class of stochastic differential games with asynchronous horizons structure. Theorem 3.1 and Theorem 4.1 can be readily extended to this general structure with more than two players in each time interval.
Finally, this is the first time that subgame consistent cooperative solutions are analyzed and derived in stochastic differential games with asynchronous players’ horizons, further research along this line is expected.
Acknowledgement
Financial support by the HKSYU is gratefully acknowledged.
References
Basar, T. and G. J. Olsder (1999). Dynamic Noncooperative Game Theory, SIAM Edition, SIAm’s Classics in Applied mathematics, Society for Industrial and Applied Mathematics, Philadelphia.
Haurie, A. (1976). A note on nonzero-sum differential games with bargaining solutions.
Journal of Optimization Theory and Applications, 18, 31-39.
Jrgensen, S. and D. W. K. Yeung (2005). Overlapping Generations Stochastic Differential Games. Automatica, 41, 69-74.
Petrosyan, L. A. (1997). Agreeable solutions in differential games. International Journal of Mathematics, Game Theory and Algebra, 7, 165-177.
Petrosyan, L. A. and N. N. Danilov (1982). Cooperative Differential Games and Their Applications. Izd. Tomskogo University. Tomsk.
Petrosyan, L. A. and D. W. K. Yeung (2007). Subgame-consistent Cooperative Solutions in Randomly-furcating Stochastic Differential Games. International Journal of Mathematical and Computer Modelling (Special Issue on Lyapunov’s Methods in Stability and Control), 45, 1294-1307.
Yeung, D. W. K. (2011). Dynamically Consistent Cooperative Solutions in Differential Games with Asynchronous Players’ Horizons. Annals of the International Society of Dynamic Games, 11, 375-395.
Yeung, D.W. K. and L. A. Petrosyan (2004). Subgam,e consistent cooperative solutions in stochastic differential games. Journal of Optimization Theory and Applications, 120(3), 651-666.
Yeung, D. W. K. and L. A. Petrosyan (2005). Subgame consistent solution of a cooperative stochastic differential game with nontransferable payoffs. Journal of Optimization Theory and Applications, 124(3), 701-724.
Yeung, D.W. K. and L. A. Petrosyan (2006). Cooperative Stochastic Differential Games. Springer-Verlag. New York.