Contributions to Game Theory and Management, XIV, 38-48
Non-Zero Sum Network Games with Pairwise Interactions*
Mariia A. Bulgakova
St. Petersburg State University, 7/9 Universitetskaya nab., Saint Petersburg, 199034, Russia E-mail: mari_bulgakova@mail .ru
Abstract In the paper non-zero sum games on networks with pairwise interactions are investigated. The first stage is network formation stage, where players chose their preferable set of neighbours. In all following stages simultaneous non-zero sum game appears between connected players in network. As cooperative solutions the Shapley value and r-value are considered. Due to a construction of characteristic function both formulas are simplified. It is proved, that the coefficient A in r-value is independent from network form and number of players or neighbours and is equal to 2 ■ Also it is proved that in this type of games on complete network the Shapley value and r-value are coincide.
Keywords: cooperative games, network games, dynamic games, the Shapley value, r-value.
1. Introduction
The theory of cooperative network games is an important part of modern game theory that will be used to construct solutions in games on networks. This theory-includes the cooperative trajectory, the strategies that generate it, the payoff along the cooperative trajectory, as well as the distribution of payoff between the players and the analysis of the dynamic stability of solutions (Petrosyan, Danilov, 1979).
The network illustrates the interaction between players and the possibility of cooperation. The main attention is given to cooperative behavior of player that is behavior in which the joint payoff of players will be maximal.
The principle of pairwise interaction used in this article was first introduced in (Dyer, Mohanaraj, 2011) with the aim of finding a Nash equilibrium in nonzero-sum game. This principle implies splitting the game into a family of simultaneous games between pairs of players — the vertices of the same edge in the network. In (Bulgakova, Petrosyan, 2015) the first time the principle of pairwise interaction was applied to cooperative two-stage games. The paper (Bulgakova, Petrosyan, 2019(a)) is devoted to finding the basic solutions of two-stage cooperative games with pairwise interaction and also the supermodularity of the characteristic function is proved. This property significantly increases the value of such a solution as the Shapley value, since in the case of a convex game it always belongs to the core. In this regard, in games based on pairwise interaction, this solution is of particular interest, also because of possibility to simplify the corresponding calculations. Considering the construction of the characteristic function, another solution proposed in (Tijs, 1987), namely, r-value.
In this paper, a special type of multi-stage cooperative games on the network is considered, which is distinguished by the way of constructing the characteristic function. Like in (Bulgakova, Petrosyan, 2019(b)), first the cooperative trajectory
*This work was supported by the Russian Science Foundation (project No.17-11-01079). https://doi.org/10.21638/11701/spbu31.2021.03
is found, and then the characteristic function is calculated taking into account the cooperative strategies of players. As a solution, the Shapley value and r-value are considered, for the latter, the coefficient A is calculated, which is the same for any number of players and for any network design in a given game. It is also shown that in this game the Shapley value and r-value coincide. The illustrative example is presented.
2. The model
Let an abstract finite space Z be given, called the state space. On the first stage in state (position) z0 G Z, players form the network g(z0), where the vertices are players, and edges are connections between them. In every following state zk G Z players can change the network by cancelling connections, and after playing n-person non-zero sum game r (zk) on network g(zk).
Define the rule of network formation g(z0) on the first stage similar to [8]: in the initial state z0 every player i G N choose to behavior bi(z0) = (bn(zo),..., bjn(z0)) — n-dimensional connection vector with components taking values 0 or 1. Introduce following notations: Mj C N \ i — those players, whom player i G N is offering connection. The value aj G {0,..., n — 1} is maximal number of player i connections on every stage. If Mj = N \ {i}, player i can offer a connection to all players, and if a = n — 1, player i can support any number of possible connections. Thus, each player is restricted by the number of connections a.j, which he has to offer, and set Mj of players, available for creating connections. In other words, on the first stage control yj(z0) is vector of connection offers bj(z0).
Player i chose subset of players Qj C Mj, which he want to see as his neighbors.
Then components of the vector bj(z0) are defined in following way:
, / n /1, if j G Qj, m
bjj(zo) = \0,ifj G Qi or i = j, (1)
under additional condition
y^ bjj(zo) < aj. (2)
jeN
Condition (2) means, that the number of possible connections is limited for each
player. Note, that |Qj| < aj. The connection can be realized only with a player from Q
The connection ij is realized if and only if bj(z0) = bjj(z0) = 1, i. e. i G Qj, j G Qj. Formed connections ij create edges of the network g(z0), where players are
vertices, i. e., if bj = bjj = 1, then in the network g an edge with end vertices i and j
Denote by Nj(g(z0)) neighbors of player i in the network g(z0), i. e. Nj(g(z0)) = {j G N \{i} : ij G g(z0)}.
After network g(z0) formation players pass to state zi(g(z0)), which depend on network g(z0). In state z1(g(z0)) players are given the opportunity to remove some of previously established connections, thus rebuilding the network g(z0) in g(z1) and forming new sets of neighbors N(g(z1)^. On the network g(z1) players play the game r(z1), which is a simultaneous non-zero sum game between neighbors on the network.
On the second stage, in the state z1 player i, i = 1,n, chooses control yj (z1) = (b (z1), xj (z1)) from finite set Y, which, unlike the first stage, contains an additional
component xj(zi) — behavior in game r(z^. And bj(zi) is a vector with components 0 or 1, obtained by the following rule:
, ( , J1, save link ij, ,
bij (zi)= \0, delete link j, W
i. e. the player in the second stage has the ability to delete existing links, however he does not have the ability to create new. xi(z1) is behavior of player i in game r(z1) and it is chosen from finite set Xi(z^, defined in state z1.
Let y(z1) = (y1(z1),..., yn(z1)) be a strategy profile in game r(z1). Payoff of player i in game r(z1) equals to:
Hi(zi) = hi(yi(zi), yj (zi)),
jeNi(s(zi))
where g(z1) is network, resulting from the strategy profile y(z1), which provides the ability to remove some edges from the network g(z0), the functions hi(xi(z1),xj(z1)) > 0 are given for all i G N and for all pairs ij, i. e. for all edges of network g(z1) and all possible states z G Z.
Suppose that in state zk-1 G Z in game r(zk-1) players i G N chose controls (y1(zk-1),..., yn(zk_1)). As a result of the choice of these controls, the transition in a state zk takes place, where the game r(zk) happens, with payoffs hi(xj-(zk), xi(zk)). That is, the state at the next stage of the game depends on the state at the current stage and on the controls selected at this stage. We can define the mapping T : Z x Y1 x Y2 x ... x Yn ^ Z by the formula
zk = T(zfc_i;yi(zfc_i),y2(zfc_i),... ,yn(zfc_i)), k = 1,1 (4)
The mapping T uniquely identifies the state zk, which follows the state zk_1, provided that the controls have been selected yi(zfc_i),y2(zfc_i),... ,yn(zfc_i) at state zfc_i.
Consider multistage game G(z^ which ^^^^^^s as follows. The game G(z0) starts in state z^. In state z0 the network g(z0) is being formed, after the players pass to the state z^ In state zk_1, k = 1,1 — 1 players choose controls y1(zk_1), y2(zk_1),..., yn(zk_1^, play game r(zk_1) and pass to the state zk = T(zk_i; yi(zk_i),y2(zk_i),..., yn(zk_i)) Game ends on stage I +1 in state z£. Thus, as a result of control choices at each stage of the game, the path z0, z1,..., zk,..., z^ is realised. z
states generated by these controls z0, z1,... zk, k < I, such that zk = z.
The concept of a strategy in the resulting multistage game is introduced in a natural way: yi(^), i G N, — as a rule, which to each admissible state z of game matches components bi(z),xi(z) controls in this state, i. e. selection of links to remove and selection of behavior xi(z) in game r(z). From the above description it follows that any strategy profile y() = jy1(^),..., yn(^)} uniquely determines the path in the game, and, consequently, the payoff of each player as the sum of his payoffs in games realized along the path:
i
Hi(y()) = E m hi(yi(zk ),yj (zk)).
k = 1 j£Ni(S(z))
Note that the set of all possible paths in the multistage game G(z) is finite, and thus the set of all admissible states in the game is finite. We denote this set by
zcz.
Suppose players choose controls y^z), i G N, which maximize their joint payoff in the game G(z), i. e.
i i
Hi(yi(zfc),..., yn(zfc)) = ma^^ Hi(yi(zk),..., yn(zfc)). (5)
k=i ieN y k=i ieN
Strategy profile y = (y1 ,...,yn) will be called the cooperative behavior in the game G(z), ^d trajectory (z0, zi,..., zi) which corresponds to the controls yi(z), i G N, — we will call cooperative trajectory (z0 = z0).
Consider single stage game r(z) in arbitrary state z G Z in a cooperative form and define its characteristic function v(S; z), S c N, for every subset (coalition) S c N according to the rule:
v(0; z) = 0, (6)
v({i}; z) = 0, (7)
v({j)• z) = i hi(xi(z); (z)) + hj(xj(z); xi(z))' if j G Ni(g(z))' (g)
({ j}; ) [ 0, in other case, ^ '
v(S;z) = ^ Y^ hi(xi(z); (z)) (9)
ieS jeNi(g(z))rS
v(N; z) = £ ]T hi(Xi(z); Xj(z)), (10)
iEN jeNi(g(z))
where Xi(z),Xj(z) are obtained according (5).
v(S; z)
and corresponding game is convex (see Bulgakova, Petrosyan, 2019(a)).
Here, unlike the previous approaches, in which the characteristic function was
S
additional coalition N \ S, the process of construction of this function is carried out as follows. To calculate the value of the characteristic function, it is necessary to determine the cooperative behavior in the game G(z0) and then calculate v(S; zk), k = 1,1, under the assumption that players choose cooperative behavior as a control component.
Find the characteristic function V(S; zk) of multistage game G(zk), started in zk S
trajectory (y(z0), y(z1),..., y(z;)) in I — k + 1 stages, starting from k:
i i
V (S; zk) = v(S; zr ) = hi(Xi(zr ),Xj (zr)),
r=k r = k ieS jeNi(g(zr ))rS
V(S; zi) = v(S; zi).
3. The T-value
Consider as a solution for game G(z0) the t-value (Tijs, 1987). Start with a special case — single stage game r(zi) which appears after network formation stage.
Proposition 1. In the game r(z^, T-value equals to:
Tj(N, v, zi) = 1(v(N, zi) - v(N \ {j}), zi) =
= 1 E (hi(xî(zi), (zi)) + (xi(zi ),xj (zi))).
ieWj (g(zi))
Proof. We use values of characteristic function for a single player and maximal coalition N (6), (10):
Components of T-value for convex game can be calculate by the formula:
Tj(N, v) = A(v(N) - v(N \ {j})) + (1 - A)v({j}) where coefficients A is defined from equation
E (A((v(N) - v(N \ {j})) + (1 - A)v({j})) = v(N) jew
Since v({j}) = 0 (see (7)), the term (1 - A)v({i}) also is equal to zero.
Therefore, for two-stage game with pairwise interactions we have following formula for calculating T-value:
Ti(N, v, zi) = A(v(N, zi) - v(N \ {i}), zi) (11)
A
E A(v((N, zi) - v(N \ {j}), zi) = v(N, zi) (12)
jew
Calculate the values of the difference v(N, zi) - v(N \ {j}, z^. If player j does N N j N
loses payoff on these connections. So the considered difference will be equal:
v(N, zi) - v(N \ {j}, zi) = E (hj(Xj(zi), Xj(zi)) + hj(xj(zi),xj(zi))).
ieNj (g(zi))
Now go back to the equation (12), and substitute into it the values of the characteristic function.
EA( E (hi(xi(zi),xj(zi)) + hj(xi(zi),xj(zi)))) =
jew ieNj (g(zi))
= E E hi(xi(zi); xj (zi)) (13)
jeWieWj (g(zi ))
We get
( (hi(Xi(zl),Xj (zi))+ hj (xi(z1 ),xj (zi))))
jew ¿eWj(g(zi))
= ^ hi(xi(zi);xj(zi)) (14) jeNieNj (g(zi))
Due to double summation on the left side of equality (hi(Xi(zi),Xj(zi)) + hj(xi(zi),Xj(zi)), and since the summation is carried out over i and j, we get:
A • 2v(N,zi) = v(N,zi) (15)
Cancelling the same multipliers on the right and left sides of the equation, we
get:
A =2 (16)
Thus, in cooperative network game r(zi) with pairwise interactions,, coefficient n r-va mber c get that
A in T-value is equal to 2> and does not depend on the number of players, or on the number of connections between them, or on the structure of the network. And we
Tj (N,v,zi) = 2 ^ (hi(xi(zi),xj (zi)) + hj (xi(zi),xj (zi))). (17)
ieNj (g(zi))
Proposition is proved.
k
T
Now we can find the T-vdue for multistage game G(zk), starting from arbitrary
zk
1 i
Tj(N, v, z0) = 2 (hi(Xi(zk),Xj(zk)) + hj(Xi(zk),Xj(zk))). (18)
r=kieNj (g(zk))
4. The Shapley value
Now consider another solution, the Shapley value, and compare it with previous on the special network — complete network.
Proposition 2. In multistage network game G(z0) with pairwise interactions on
T
Proof. Earlier, we obtained a simplified formula for the components of the Shapley value for a two-stage cooperative game with pairwise interaction on complete network:
^iM = 2 mij'
jeN
where mij is maximal joint payoff for pair ij which create an edge in network. Now consider how we can simplify formula for calculating components of the Shapley value in considered game. As previous, we start from single stage game r(z0) and then generalize the result to multistage game.
For components of the Shapley value we will use following formula (Shapley, 1953):
Vj[v]= E (t - ^ - t)! [v(T) - \ j)] (19)
t jeTcN !
Calculate the difference in values of characteristic functions, v(T) — v(T \ i), when T = N For arbitrary coalition T we will have:
v(T, zi) — v(T \ {j}, zi) = E (hi(xi(zi),Xj(zi)) + hj(xi(zi),xj(zi))).
ieNj (g(zi))nT
Note, that in (19), in the term with a fixed value (t), the sum (hi(xi(zi), xj (zi)) + hj(xi(zi),xi(zi))) will occur exactly 222 times. As a result we have:
Vj[v] =
= E (t — 1)i(!n — t)! ■ Cn—22 ■ E (hi(Xi(zi),Xj(zi)) + hj(Xi(zi),xi(zi))).
T |jeT CN ! ieNj (g(zi))nT
(20)
And, simplifying, we get:
n t - 1
Vj[v] = E ^fe^x(zi)) + hjfe^ax(zi)))-En.(—_ 1) (21)
ieNj(g(zi))nT t=2 n ' (n )
Vj [v] = E (hi(xi(zi), xj(zi))+hj(xi(zi),Xj(zi)))-(n , 1 1) + ^(n 2 1)
I * I n 1) It 2
ieNj (g(zi ))nT
(22)
Finally we have:
Vj[v] = 1 ■ E (hi(xi(zi),xj (zi)) + hj (Xi(zi),xj (zi))) (23)
ieNj (g(zi))nT
Thus r-vdue coincides with the Shapley value in game r(zi) on complete network. In multistage game G(z0) we have following formula for components of the Shapley value:
1 1
Vj[v] = 2 "E E (hi(xi(zo)
, xj (z0 )) + hj (xi (z0 ), xj (zo))) (24)
r=fc ieNj (g(zo))nT
which also coincides with r-value on complete network.
5. Example
Consider the case, where N = 3 I = 3, i. e. the game consists of four stages and starts in the state z0. In this state set Mj of players, whom player i can offer a connection, is given
Mi = {2, 3}, M2 = {1, 3}, M3 = {1, 2},
as well as restrictions on the number of connections that each player can support:
ai = 1, a2 = 1, 03 = 2.
In state z0 players choose behaviors bi(z0) — formed a network g(z0) and then pass to the state z2. In every state zk, k > 1 players choose controls yi(zk) = (bi(zk),xi(zk)), where bi(zk) — vector regulating player connections (with components 1 or 0), and xi(zk) is defined as
Xi(zk) = Xi(z) G Xi = {xi(z), xi(z)},
X2(zk) = X2(z) G X2 = {xi(z),x2(z)}, X3(zk) = X3(z) G X3{x3(z), x^z)}
Thus every player i has a given set of control components Xi in all states zk.
For admissible states zk, k > 1, and all possible strategies payoffs are given hi(xi(zr),Xj(zr)) in following way: hi(xi(zr),Xj(zr)) and hi(xi(zr),Xj(zr)).
In state zi the game happens with payoffs h(xi(zi),xj(z2)). In state zi every player i G N chose his control component xi(zi^, and if all xi(zi) = xi(zi),i G N,
z2
hi(xi(z2),Xj(z2)). If at least one of the components xi(zi) = x2(zi),i G N, then in state z2 players play a game with payoffs hi(xi(z2),xj(z2)). In a similar way, the transition to the state z3: if all xi(zi) = xi(zi),i G N, then the players in z3 use payoffs hi(xi(z3),xj(z3)), again if at least one of the components xi(zi) = x2(zi),i G N, _ payoffs are hi(xi(z3),xj(z3)). Payoffs hi (xi (zr), Xj (zr)):
hi(xi, xi!) = 4 hi(xi, X3) = 5, h2(xi!, X3) = 5, hi(x^x,!,) = 3 hi(xi,x3) = 3 h2(x2,xi,) = 1, hi (xi, x2) = 5, hi(xi, X3) = 1, h2(x2, x3) =4, hi(x2,x2) = 5 hi(xi,x3) = 2 h2(x2,x3) = 1, h2(x2, x];) = 4 h3(x3, xi) = 5 h3(x3, x2) = 5, h2(x2,X2) = 3 h3(x3,xi) = 3 h^x^x2) = 1, h2(x2,xi) = 5 h3(x3, xi) = 1, h3(x2, xi) = 4, h2(x2,x2) = 5 h3(x3,xi) = 2 h3(x3,x2) = 1;
payoffs hi(xi(zr),Xj(zr)):
hi(xi,x2) = 8 h'i(xi,x3) = 6 h2(x2,xi) = 12,
h2(xl,x2) = 3 hi(xl,x3) = 5 h2 (x2, x3) = 10)
h1(x1, x2) = 7 h1(x1, x3) = 4 h2(x2, x3) = 5J
h'i(x2,x2) = 4 h'i(x2,x3) = 3 h2(x2,x3) = 4, h2(xi,xi) = 8 h3(xi,x1) = 6 h3(x3,x2) = 12, h2(x2,x1) = 3 h3(x1,x1) = 5 h3(x3,x2) = 10) h2(x2, x1) = 7 ^fe x1) = 4 h3(x3, x2) = 5)
h2(x2,x2) = 4 h3(x2,x2) = 3 h3(x2, x2) = 4.
z0
of all players:
bi(z0) = (0,0,1), b2(z0) = (0,0,1), 6a(z0) = (1,1,0) As a result, a network (see fig. 1) is formed.
Fig. 1. Network 011 the first stage of game.
Tb maximize the joint payoff, it is beneficial for the players to keep connections with all neighbors throughout the game, i. e. 6j(z0) = h(zi) = 6j(z2) = &j(z3), for all i G N Components of the player controls y%(z)
xi(zi) = x2, x2(zi) = x3,, X3(zi) = x1,,
Xi(z2) = X1, X2(z2) = X2, X3(z2) = X3,
Xi(z3) = x1, X2(z3) = x2, X3(z3) = x1,.
Calculate the values of characteristic function v(S; z) in all states along cooper-zo
and players do not receive payoffs:
S {1} {2} {3} {12} {13} {23} {123}
zi) 0 0 0 6 6 10 16
v(S z2) 0 0 0 14 12 20 32
v(S z3) 0 0 0 14 12 20 32
Table 1. The values of characteristic function v(S; z).
zi
state. In each state, the players have only two alternatives: either, as a result of the choice of controls, they will play the game r(zk) with payoffs hj(Xj(zk), Xj(zk)) in the next, state, or pass in state, where the game will take place with payoffs hi(Xj(zfc ),Xj (zfc)).
Fig. 2. Tree of all possible states of game.
Numbers 1 and 2 above arrows (fig. 12) show, what payoffs will be used by-players in the next state: 1 means hj(Xj(zk),Xj(zk)), 2 means hi(Xj(zk),Xj(zk)).
Optimal trajectory in game G(z0): z = (z0, zj, z|) = (z0, zi; z2, z3). Calculate the values of characteristic function in multistage game G(z0):
S {1} {2} {3} {12} {13} {23} {123}
V(S; Z3) 0 0 0 14 12 20 32
V(S; Z2 ) 0 0 0 28 24 40 64
V(S; Zi ) 0 0 0 34 30 50 80
Table 2. The values of characteristic function in multistage game G(z0). Calculate r-value and the Shapley value in every stage
Ti(N,v,zi) = 1 (hi(x1,x2)+h2(x1,x1)+hi(x1,xi)+h3(x3,x2)) = 2(0+0+3+3) =3 T2(N,v,zi) = 1 (h2(x2,xi )+h i (x2,x2)+h2(x2,x1 )) = 2(0+0+5+5) = 5
T3(N,v,zi) = 1 (h3(x3,x1)+hi(x2,x3)+h3(x3,x1)+h2(x2,x1)) = 2(3+3+5+5) =8 Ti(N,v,Z2) = 2 (hi(x1,x2)+h2(x2,x1)+hi(x1,x1)+h3(x3,x1)) = 1(0+0+6+6) =6 T2(N,v,z2) = 2 (h2(x2,x1) + h'i(x1,x2) + h2(x2,x3) + h3(x3,x2)) =
= 2(0 + 0 + 10 + 10) = 10
T3(N,v,z2) = 2 (h3(x1,x1) + hi(x1,x3) + h3(x3,x2) + h2(x2,x3)) =
= 2(6 + 6 + 10 + 10) = 16
Components for Tj(N, v, z3) ^re equal to Tj(N, v, z2) because the payoffs are the same on these stages. In multistage game G(z0)
Tj(N,v,z0) = 3 + 6 + 6= 15 t21n,v, z0) = 5 + 10 + 10 = 25 t3(n, v, z0) = 8 + 16 + 16 = 40
T
6. Conclusion
We considered a special type of multi-stage cooperative games on the network which is distinguished by the way of constructing the characteristic function. The main point here is, finding the cooperative trajectory firstly, and then the characteristic function is calculated taking into account the cooperative strategies of
T
the coefficient A is calculated, which is the same for any number of players and for any network design in a given game. It is also shown that in this game the Shapley
T
Acknowledgements. We acknowledge for the support Russian science foundation (project No.17-11-01079).
References
Tijs, S.H. (1987). An axiomatization of the o-value. Mathematical Social Sciences, 13, 177-181.
Dyer, M. and V. Mohanaraj (2011). Pairwise-Interaction Games. ICALP: Automata, Languages and Programming, P. 159-170.
Shapley, L. S. (1953). A value for n-person games. Contributions to the theory of games II, ed. by H. W. Kuhn, A. W. Tucker. Princeton. P. 307-317.
Bulgakova, M. A. and L. A. Petrosyan (2015). Cooperative network games with pairwise interactions. Mathematical game theory and applications, 4(7), 7-18.
Bulgakova, M. A. (2019a) Solutions of network games with pairwise interactions. Vestnik of Saint-Petersburg State university. Series 10. Applied mathematics. Informatics. Control processes, 15(1), 147-156.
Bulgakova, M. A. (2019b). About one multistage non-zero sum game on the network. Vestnik of Saint-Petersburg State university. Series 10. Applied mathematics. Informatics. Control processes, 15(4), 603-615.
Petrosyan, L. A. and N.N. Danilov (1979). Stability of solutions of non-zero-sum, game with transferable payoffs. Vestnik of the Leningrad University. Series 1. Mathematics. Mechanicks. Astronomy, 1, 52-59.