Contributions to Game Theory and Management, XIII, 296-303
The Dynamic Nash Bargaining Solution for 2-Stage Cost
Sharing Game*
Li Yin
St.Petersburg State University, Faculty of Applied Mathematics and Control Processes, TJniversitetskii prospekt 35, St.Petersburg, 198504, Russia E-mail: [email protected] Home page: http://www.liyin.site
Abstract The problem of constructing the Dynamic Nash Bargaining Solution in a 2-stage game is studied. In each stage, a minimum cost spanning tree game is played, all players select strategy profiles to construct graphs in the stage game. At the second stage, players may change the graph using strategy profiles with transition probabilities, which decided by players in the first stage. The players' cooperative behavior is considered. As solution the Dynamic Nash Bargaining Solution is proposed. A theorem is proved to allow the Dynamic Nash Bargaining Solution to be time-consistent. Keywords: Dynamic Nash Bargaining, dynamic game, minimum cost spanning tree.
1. The Model
In the paper, we consider a 2-stage game with spanning tree.
* H = (Z, F) is a finite game tree with the initial vertex zi.
* Z is a set of vertexes in the game tree.
* F(z1) is a point-to-set mapping: F(z1) C Z.
* In the second stage, F(z1) is the set of vertexes on the tree-like graph.
*i.e. F(zi) = Z \ {zi}.
* m(z1) is the number of elements in the set F(z1).
* r(z1) is the game starting from initial vertex z1.
* Similarly, r(zk) is the subgame starting from the vertex zk € F(z1).
* N = {1,2,..., n} is a finite set of players.
* N' = N u{0}. {0} is the source.
* G(N', E) = {(i, j) : Vi, j € N'} is a graph over N'.
* E is the set of all edges.
* If 3(i1,i2),(i2,i3), ..., (in-1, in); such that (ifc, ife+1) € G(N', E), 1 < k < n-1, and i1 = i, in = j, then two vertexes md j € N' are said to be connected in G.
* If all i, j are connected in G, a graph G is called connected over N', Vi, j € N'.
* Gn' is the set of connected graphs over N'.
Definition 1. The cost of connections is represented by a cost matrix (Li, 2016)
Cm = (cij )(n+1)x(n+1), (1)
where cij = cji > 0 is the cost of conn ecting i and j, i = j € N '.In the paper, cio = coi is a nonnegative constant, and cost matrices are nonnegative, symmetric.
* This work was supported by the China Scholarship Council (No.201508090030).
The Dynamic Nash Bargaining Solution for 2-Stage Cost Sharing Game 297 Definition 2. At each stage, player i chooses a vector
xi (xi,1 ? • • • ? xi,i-1 ? xi,i+1 ? • • • ? xi,n ) ?
where xij G Xij is a strategy of player i against player j. Similarly, xjji G Xjji is a strategy of player j against player i. At different stages, the set Xij may be different for the player i, Vi, j G N
Definition 3. At each stage, the cost of edge (i, j) is defined as
cij = cji = fc(xi,j, xjii), Cio = coi > 0, Vi, j G N. (2)
where function fc is a mapping from strategies of players i, j to the set R+ U
(i, j)
Definition 4. Tx(N', Cm) is the minimum cost spanning tree (m.c.s.t.) (Bird, 1976) over N'
Tx(N',Cm) = arg min cj
GgGn ' —
(i,j)eG(N ',E)
where Cm = (cij)(n+1)x(n+1) is the cost matrix.
Definition 5. C[Tx(N',Cm)] is the total cost of edges in the m.c.s.t. (Bird, 1976) Tx(N ',Cm )
C[Tx(N', Cm)] cij (3)
(i,j)ETx (N',Cm )
2. Description of the Game
2.1. Stage 1
Players simultaneously choose their behaviors, i.e. n-dimensional strategy profiles
x1(z1) =(x1 (z1),...,xn(z1)),
x1(z1 ) = (xг1,1(z1), . . . x1,i+1 (z1^ . . . ,x1,n(z1 ))'
where x1,j-(z1) G Xi1j is a strategy of player i against player j, Vi = j, i,j G N By definition (1)(2), this means that at stage 1 player i and player j choose their strategies x1,j-(zi^d xj,i(z1) and build an edge (i,j), and cj = j =
fc(xi,j (z1 ),x1,i(z1))-
2.2. Stage 2
The game proceeds to the second stage with probability, which depends on strategies of players chosen on stage 1. The transition probabilities are defined as following
p(z1,zfc,x1(z1), ... ,x^(z1)) = p(z1, Zk,x1(z1)) > 0,
£ p(z1,zk ,x1 (Z1)) = 1 (5)
zk£F (zi)
where p(z1, zk, x1(z1)) is the probability that the game moves from initial vertex z1
zk
Assume that each vertex zk G F(z1) is associated with a matrix called a-matrix.
Definition 6. The a-matrix of the stage game in vertex zk is defined as
a[p(z1,zfc ,x1(z1))]
/to 1 1
1 to a1;2 1 «2,1 •••
1 «n-1,1
\ 1 a„,1
11
«1,n-1 «1,n
to an- 1,n an,n-1 ^^
,zfc € F(z1)
1or + to,Vi = j € {1, • • • ,n}
where x1 (z1) are the strategies of players in the previous stage game starting from initial vertex z1.
Definition 7. For two matrices A and B with the same dimension (m x n), the Hadamard product A o B (Horn, 2012) is a matrix which elements are given by
«11 x 6n • • • a.1n x 61^
«11 • • • «1n\ /611 • • • 61n\
A o B = |......I o I......I = I ••• ••• 1 (6)
• • • «ran J \6m1 • • • 6mn/ \«m1 x 6m1 • • • «ran x 6mn/
The cost matrix of the stage game in the vertex zk € F(z1) is defined as follows
Cm(zfc) = a[p(z1,zfc,x1 (z1))] o {c2j}(n+1)x(n+1), c2,- = j = fc(x2, j (zfc),x2 i(zfc)), Cio = coi > 0
(7)
where x2,,(zk) € X2, is a strategy of player i against player j, Vi = j,i,j € N, zfc € F(z1).
Example 1. The Fig. 1 shows, how the strategy profiles x1 (z1) can influence the game played in the second stage.
Zl x1(z,) Stage 1
Z2 H zk zm+1 Stage 2
p(z1,z2,x1(z1)) p(z1,z3,x1(z1)) p(z1,zk,x1(z1)) p(z1,zm+1,x1(z1))
Fig. 1. The diagram of the 2-stage game with spanning tree
3. Cooperative Game
In 2-stage m.c.s.t. game assume that the total cost of players is the sum of the
cost of players on both stages (Parilina, 2015).
During the game suppose that the path z', z'' is realized. Let xj(z') = x1(zi) and xj(z'') = (x2(z2),..., x2(zk)),i G N, zk G F(z1). The cooperative solution of the game r(z1) for the set N' at the first stage is defined as follows:
V 1(N ',zi)
= min{C[Tai(zi)(N',Cm(zi))]+ £ p(zi,zfc,x1(zi))C[T,2(zfc)(N', Cm(zfc))]}
zfc eF(zi)
= C [Txi(zi) (N' ,Cm(zi))]+ p(zi,zfc ,xi (zi))C [T,2(zfc) (N',Cm (zfc))],
zk eF (zi)
Vi(0,zi) = ü,zfc G F(zi), (8)
where p(zi,zk,xi(zi)),k G {k : zk G F(zi)} are defined in (5), Vi(N',zi) is the value of characteristic function for set N' in the game r(zi).
Strategies Xj(•),« G N are called cooperative strategies, and strategy profile x(-) = (Xj(•),« G N) - cooperative strategy profile.
3.1. The Value of the Game for the Player in the Game r(z1)
The value of the game for the player i G N is defined as the value of the zero-sum game in which player i plays against players from N\ {i}. In the zero-sum game, all players in N \ {i} don't want to be connected to the source. Thus in this situation assuming that players in N \ {i} are out and the m.c.s.t. contains only one edge -(i, 0), i G N, which means that the cost of this unique edge in each stage game is the cost of m.c.s.t. of this stage.
V i({i}',zi) =
min {C [Txi(zi) ({i}',C»' (zi))]+ £ p(zi,zfc ,xi(zi))C [T> (zfc) ({i}',C»' (zfc))]}
zfc eF (zi)
= C [T;Si(zi)({i}',C«' (zi))]+ £ p(zi,zfc ,Xi (zi))C [T,2(zfc)({i}',Cij>' (zfc))]
zfc eF (zi)
= Cjo + £ P(zi,zfc ,Xi (zi))cjo = 2cj0
zfceF (zi)
' (9)
where p(zi;zk,xi(zi)),k G {k : zk G F(zi)} are defined in (5). C{j} (zi) and C{j} (zk) is the cost matrix restricted to {i}' and is determined by (1) and (7). It means that C{j} (zi^d C{j} (zk) are sub-matrices of matrix C(zk).
3.2. The Value of the Game for the Player in the Game r(zk)
Suppose that the subgame r(zk) happened in the vertex zk G F(zi) of the tree-like graph H = (Z, F).
According to the definition (5), p(zi; zk, xi(zi)) - the transition probability to proceed from initial vertex zi to the vertex zk. Thus, the cooperative solution of the subgame r(zk) for the set N' at the second stage is defined as follows:
V2(N',zfc) =min C p>(zfc)(N',Cm (zfc))] = C [Tg2(zfc) (N' ,Cm (zfc))], V 2(0,zfc) =0,
where C(zk) is defined % (7). Strategies x2(^),i e N are a cooperative strategies. Strategy profile x2(-) = (x2(•),« e N) is a cooperative strategy profile. V2(N', zk) is the value of characteristic function for set N' in the subgame r (zk).
In a similar way, the value of the game for the player i e N is defined as following
V2({i}', zk) = nmin C[Tx2(zfc)({i}',C«'(zfc))]
x2() (10) = C [T^k) ({i}',C»' (zk ))]= cio,
where the subgame in the vertex zk with the probability p(z1; zk, x1 (zi)), zk e F(zi) which are defined in (5), C{i} (zk) is the cost matrix restricted to {i}'.
4. The Dynamic Nash Bargaining Solution
Let (H1(z1),..., Hn(z1)) e Si be cost vector in the game r(z1), the set of all possible costs is defined as
S1 = {Hi(z1): Hi(z1) > 0, i e N} (11)
and the value of the game for each player is V 1({i}', z1) = 2ci0, i e N. Here, S1 is bargaining set, and V 1({1}, z1),..., V 1({n}, z1) e S1 - disagreement point. Consider the following expression:
n
max TT[V 1({i}',z1) - Hi(z1)]
Hi (zi)<V1 ({i}' ,z1),(H1(z1),...,Hn
i=1
n
= n[V 1({i}',z1) - H(z1)],i e N.
i=1
Vector (H 1(z1),..., Hfn(z1)) is called Nash bargaining solution.
At the second stage, if the game proceeds to the stage game on the vertex zk e F(z1) with probability p(z1; zk,x1(z1)), the set of all possible costs is defined as
Sk = {Hi(zk) : Hi(zk) > 0, i e N.} (12)
and, the Nash bargaining solution in one stage m.c.s.t. game is defined as follows:
H[V2({i}',zfc) - Hi(zfc)]
Hi(zk)<V2({i}',zk),(Hi(zi),...,H„(zi))eSfc J
i= 1
n
= ![[V2({i}',zk) - Hi(zfc)],i G N.
i=i
Time consistency of the cooperative solution concept was introduced for the first time in (Petrosyan, 2006).
Using the IDP (Imputation Distribution Procedure) the Dynamic Nash Bargaining Solution is constructed (Junnan, 2018).
Definition 8. Imputation distribution procedure of the Nash Bargaining Solution in 2-stage m.c.s.t. game is a scheme ft = (^1, ^2) s.t.
# = Hi(zi) - ]T p(zi,zfc, x1(zi))Hi(zfc), Vi G N
zkeF (zi)
0
Fig. 2. The figure on the left side is the tree-like graph of the game. The figure on the right side is the graph at each stage game.
= ^ p(zi(zi))iHi(zk), Vi G N, zk (zi)
where p(z1, zk, x1 (z1)) is the transition probability from the initial stage to the stage game r(zfc), zfc G F(zi).
Definition 9. The Nash Bargaining Solution (z1), i G N is called time consistent in the game with spanning tree, if there exists a nonnegative IDP (fl1 > 0, fl2 > 0, Vi G N) such that the following condition holds:
Hi(Z1 ) = fl1 + ^ p(z1 ,Zk,x1 (Z1 ))Hi(zk), Vi G N, (13)
Zk GF (zi)
p(z1 ,zk,x1 (z1 ))HHi(zk) = fl2, Vi G N, (14)
Zk GF (zi)
where p(z1, zk, x1 (z1)) is the transition probability from the initial stage to the stage game r(zk), zk G F(z1).
Unfortunately, in 2-stage m.c.s.t. games the IDP fl may take negative value.
Proposition 1. Constructed above IDP fl for the Dynamic Nash Bargaining Solution (H (z1),..., (z1)) is time inconsistent.
We propose a counterexample in order to verify the proposition.
Example 2. In this example, we consider a two-person game with spanning tree as an illustration in Fig.2.
The set of players is N = {1, 2}, and the source is {0} N' = N U {0}. The sets of strategies, which player 1 uses against player 2, are X-j12 = {3, 4} X22 = {6, 7}, and the sets of strategy, which player 2 uses against player 1, are X^ = {6, 2}, X| 1 = {8,3}. Assume that there are two vertexes z2, z3 following after the initial z1
As shown in Fig.2, in each stage there is a graph over N'. Assume that edges (0,1) (0, 2) are fixed and the cost of edges are co1 = c1o = 80, c02 = c20 = 10. The function fc is defined as fc = x1j2 x x2)1, x1j2 G X1)2, x2j1 G X2j1.
The matrices of the stage game in vertexes z2 and z3 are described as follows
/ro 1 A /ro 1 i\
a[p(z1 ,z2 »x1 (z1))] = I 1 ro 1 I , a[p(z1, z3 ,x1 (z1))] = I 1 rorol \ 1 1 roj \1 roroy
where x1(z1) G Xi1 is the strategy profile of players in the first stage, i G N.
In the case of different strategy profiles of players, the game's transition probabilities are
x1 (z1 ) = (3, 6) : p(z1,z2,x1(z1)) = 0.5,p(z1, Z3, x1 (Z1)) = 0.5,
x1 (z1 ) = (3, 7) : p(z1,z2,x1(z1)) = 0.7,p(z1, z3, x1 (z1)) = 0.3, x1 (z1 ) = (4, 6) : p(z1,z2,x1(z1)) = 0.9,p(z1, z3,x1 (z1)) = 0.1, x1(z1) = (4, 7) : p(z1,z2,x1(z1 )) = 0.15,p(zb z3, x1 (z1)) = 0.85.
According to above-mentioned analysis, in 2-stage game we get the value of the game for each player in the game r(z1):
V1 ({1}',z1) = 160,V 1({2}',z1) = 20
The value of the game for each player in the the subgame r (z2):
V 2({1}',z2) = 80,V 1({2}',z2) = 10
The value of the game for each player in the the subgame r (z3):
V 2({1}',z3) = 80,V 1({2}',z3) = 10
The value of characteristic function for set N' in the game r(z1):
V 1(N ' ,z1) = 57.4 The Nash bargaining solution in the game r(z1),
H1 (z1) = 57.4, H2(z1 ) = 0 The Nash bargaining solution in the game r (z2),
H 1(z2) = 16,i?2(z2)=0 The Nash bargaining solution in the game r (z3),
H 1(z3) = 80,HH2(z3) = 10
x1(z1) = (x1 (z1),x2(z1)),x1(z1) = 4,x2(z1) = 6 p(z1,z2,x1 (z1 ) ) = 0.9,p(z1,z3,x1(z1)) = 0.1 We construct IDP ft for the Dynamic Nash Bargaining Solution,
= H2 (z1) - p (z1,z2) H2 (z1) - p (z1,z3) H2 (z3)
= 0 - 0.9 x 0 - 0.1 x 10 = -1 < 0 The Dynamic Nash Bargaining Solution in the example is time inconsistent.
5. Results
Theorem 1. If in a 2-stage game with spanning tree r(z1), the following conditions hold
Hi(z1) > fii(zfc), Vi G N (15)
then, the Dynamic Nash Bargaining Solution (H1(z1),..., Hn(z1)) is time consistent.
Proof The proof follows immediately from the definition of fl1 and fl2.
Multiply both side of (15) on p(z1; zk, x(z1)) > 0 and taking in account that
£ p(z1,zfc,x (z1)) = 1, zk GF (zi)
We obtain
p(z1, zfc, x(z1))Hi(z1) > p(z1, zfc, x(z1))Si(zfc), zfc G F(z1), Vi G N
thus
£ p(z1,zfc,x(z1))Hi(z1) > £ p(z1,zfc,X(z1))Hi(zfc),
zk GF(zi) zkGF(zi)
zfc G F(z1), Vi G N
then
Hi(z1) > fl2 > 0, Vi G N
and
fl1 = Hi(z1) - fl2 > 0, Vi G N the theorem is proved □
References
Li, Yin (2016). The dynamic Shapley Value in the game with spanning tree. Stability and Oscillations of Nonlinear Control Systems (Pyatnitskiy's Conference). 2016 International Conference. IEEE. pp. 1-4. Bird, C. G. (1976). On cost allocation for a spanning tree: a game theoretic approach.
Networks, 6(4), 335-350. Horn, R. A., Johnson, C. R. (2012). Matrix analysis. Cambridge university press. Petrosyan, L. (2006). Cooperative stochastic games. Advances in dynamic games,
Birkhauser Boston, 139-145. Junnan, J. (2018). Dynamic Nash Bargaining Solution for two-stage network games. Contributions to Game Theory and Management, 11(0), 66-72. Parilina, E. M. Stable cooperation in stochastic games. Autom Remote Control, 76, 1111— 1122.
Granot, D., Huberman, G. (1981). Minimum cost spanning tree games. Mathematical programming, 21(1), 1-18. Petrosyan, L. A. (1977). Stability of the Solutions of Differential Games with Several Players. Vestnik of the Leningrad State University, 19, 46-52.