Научная статья на тему 'GAME THEORETIC MODELS OF SUSTAINABLE MANAGEMENT IN MARKETING NETWORKS'

GAME THEORETIC MODELS OF SUSTAINABLE MANAGEMENT IN MARKETING NETWORKS Текст научной статьи по специальности «Математика»

CC BY
3
2
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук
Ключевые слова
DIFFERENCE STACKELBERG GAMES / DIFFERENTIAL STACKELBERG GAMES / MARKETING NETWORKS / SUSTAINABLE MANAGEMENT

Аннотация научной статьи по математике, автор научной работы — Agieva Movlatkhan T., Korolev Alexei V., Ougolnitsky Guennady A.

Difference and differential Stackelberg games of opinion control on marketing networks are considered. The principal allocates financial resources to the firms for marketing purposes. It is supposed that the structure of a target audience described by a weighted directed graph is already determined in the stage of network analysis, and marketing control actions are applied only to the members of strong subgroups (opinion leaders). Conditions of homeostasis (phase constraints) which reflect the requirements of sustainable management are introduced additionally. The Stackelberg equilibria are found analytically. It is shown that the interests of the principal and the firms are completely compatible.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «GAME THEORETIC MODELS OF SUSTAINABLE MANAGEMENT IN MARKETING NETWORKS»

 Game Theoretic Models of Sustainable Management in Marketing Networks *

Movlatkhan T. Agieva1, Alexei V. Korolev2 and Guennady A.

Ougolnitsky3

1 Ingush State University Magistralnaya St. 39, Nazran, 386132, Russia E-mail: agieva25@mail.ru

2 National Research, University Higher School of Economics at St. Petersburg Khantemirovskaya St. ЗА, St. Petersburg, Russia E-mail: danitschi@gmail.com

3 Southern Federal University, J.I. Vorovich Institute of Mathematics, Mechanics and Computer Sciences Milchakov St. 8a, Rostov-on-Don, 344090, Russia E-mail: gaugolnickiy@sfedu.ru

Abstract Difference and differential Stackelberg games of opinion control on marketing networks are considered. The principal allocates financial resources to the firms for marketing purposes. It is supposed that the structure of a target audience described by a weighted directed graph is already determined in the stage of network analysis, and marketing control actions are applied only to the members of strong subgroups (opinion leaders). Conditions of homeostasis (phase constraints) which reflect the requirements of sustainable management are introduced additionally. The Stackelberg equilibria are found analytically. It is shown that the interests of the principal and the firms are completely compatible.

Keywords: difference Stackelberg games, differential Stackelberg games, marketing networks, sustainable management.

1. Introduction

The basic model of influence in a social network is a weighted directed graph where vertices represent the members of the network (basic agents), and arcs describe their mutual influence. Each vertex is ascribed a real value (an opinion of this agent) which is a function of time, and each arc is ascribed a constant real value (weight) which characterizes a degree of influence of an agent to another (or trust of the latter to the former). Besides, there are one or several influence agents (firms) which exert impact to the basic agents in their (influence agents) interests (Chkhartishvili et ah, 2019). An approach close to ours is used in (Sedakov and Zhen, 2019; Zhen, 2019).

This paper represents formulations and solutions of the game theoretic models of opinion control in social groups with a given network structure of the interactions. The models are interpreted in (not unique) marketing terms. It is supposed that there is a coordinating body which determines the firms’ marketing budgets. Thus, the Stackelberg games of the type "one principal - several agents" are considered. It is also supposed that in the stage of network analysis the target audience is already segmented into strong subgroups (opinion leaders) and satellites

* The work was supported by the Russian Science Foundation, project N 17-19-01038.

(Agieva et al., 2019). Then the control actions are restricted by the set of members of the strong subgroups that reduces the marketing expenditures essentially.

The objective of the control impact is maximization of the sum of opinions of all agents at the whole period from t=l till t=T. The impact in closed-loop strategies is exerted to the current opinions of the members of strong subgroups. Accordingly to the requirements of sustainable management, the state vector of the controlled dynamical system must belong to a certain domain of the space of states (homeostasis conditions) (Ougolnitsky, 2011).

The dynamics of the controlled system (opinions of the agents) is described by difference of differential equations. Respectively, the payoff functionals of the principal and the agents are or sum, or integral of their current payoff functions. An introduction of the discrete or continuous time are complementary approaches which permit to reflect different effects in real systems, and it is rational to use both types of the models. Thus, both difference and differential Stackelberg games with several agents are studied. The best response of the agents is defined as a Nash equilibrium in their game in strategic form. The difference models are analyzed by induction on the number of game periods, and the differential models are studied bymeans of the Hamilton-Jacobi-Bellman equations. For the consideration of budget constraints the Lagrange multipliers method is used.

The key problem in hierarchical systems is the coordination of interests of different control levels. The most widespread formulation of this problem is a comparison of the socially optimal outcome of the game with outcomes of the egoistic behavior of the players (the problem of inefficiency of equilibria) (Nisan et ah, 2007). In this paper the index of system compatibility is used for the characteristic of the coordination of interests. It is a fraction in which the numerator is a value of the principal’s payoff in the worst Nash equilibrium in the game of the agents, and the denominator is the value of her globally maximal payoff (Sukhinov et ah, 2020).

For the solution of the problems with the homeostasis condition we use the following approach. First, we solve a basic optimization problem without this condition and find the optimal strategies of the players. In the basic model both principal and agents maximize the summary opinions of the target audience, and the principal determines the marketing budgets of the agents. Given the optimal strategies and the homeostasis condition (the sum of the state variables should not exceed a given value x*), we suppose that the influence matrix A is stochastic (from the right). This assumption permits to abstract from the interaction of the basic agents. In this case the multiplication by the matrix A from the left does not change the sum of components of the state vector. Then the algorithm consists in optimal increasing of the sum of the state variables up to the level x*, and then to do nothing unless the period of consideration finishes.

However, this assumption is too strong. The standard assumption in the control problems on networks is that the matrix A is stochastic from the left. The matrix stochastic from the left has also a Frobenius eigenvalue equal to one, and a positive Frobenius eigenvector corresponding to this value. By means of the diagonal matrix Ξ with the components of this Frobenius eigenvector on the main diagonal, we can perform the conjunction operation and move from the initial influence matrix A to the similar matrix P which is stochastic from the right. Making a substitution of the state variables by the transform matrix Ξ, we reduce the initial problem with the influence matrix A to the problem with the stochastic matrix P in the new

coordinates. Solving this problem and making the inverse transform of the variables, we receive the solution of the initial problem.

The payment for this transform consists in a small change of the problem formulation. The initial homeostasis condition Jj=i. xj < x* takes the form Jj=i. zjxj < x*, where zj are components of the Frobenius eigenvector of the matrix A. Though these two formulations are not equivalent, they do not differ essentially. In fact, in the initial formulation we could take instead of x* the value x/Σj=1 zj, and the degree of the resulting values of the state variables remains the same. Besides, now we have to maximize not simply the sum Y^j=1 xj of the values of state variables but the projection of the demand vector to the Frobenius eigenvector of the matrix A. In other words, we maximize only the summary demand Y^j=1 zj xj along a general direction of the mutual influences of the agents as it is done with a standard good in the Leontief-Srafia model (Leontief, 1987; Srafla, 1960; Srafla, 1962).

We use this approach both in the discrete and continuous time. Respectively, the paper consists of four parts. In the Sections 2 and 3 we study difference Stackelberg game theoretic models of opinion control in which the principal should provide that a given limit value of the summary opinion not be exceeded. In the Section 2 the problem is solved for the stochastic influence matrix A, and in the Section 3 this strong assumption is substituted by a weaker standard assumption that the matrix A is stochastic from the left. The values of the system compatibility index are calculated. They are equal to one that witnesses about an ideal coordination of the modeled system. It appears that for the principal it is not advantageous to allocate to the agents more resources than they need from the point of view of their rational behavior. In the Sections 4 and 5 we analyze differential Stackelberg game theoretic models of opinion control with the homeostasis condition by the completely similar scheme. The Section 6 concludes.

2. Difference Stackelberg Game with a Constraint on the Sum of State Variables. Case of the Stochastic (from the Right) Matrix A

2.1. The Problem Statement

The model has the form

max,

J0 = ςδ Σxj-Σ'

t=0 j=1 i=1

m

- ° R t = 0, 2 ···,T, * = 1, 2, ...,m,

i= 1

n

x* < xj < R, t = 0,1, 2— 1 ,T,

j=1

Ji = ς s*

t=0

J2(xj— sj j

j=1

^ max,

'^2uj’t < rl, uf > °, t = 0, 1, 2, •••,T, * = 1, 2,...,m,

j=1

t

r

i

(1)

(2)

(3)

(4)

(5)

_ n

m / i,t , t

j = Σ j !uj +2^ aij xi, x0 = xjo, j = 1,2,.. .,n, (6)

i=1 1=1

o' Ö

i = f 1, if Λ II (7)

sj \0, if •f» T-yt* ‘<"5

Here n - a number of basic agents (a number of target audience), m - a number of control agents (competing firms), R - a total marketing budget of the leader, T -a length of the game, Jo, Ji - the payoff functionals of the leader and the followers (control agents) respectively, rf - a marketing budget allocated to the i-th follower by the leader in the moment (discrete period) of time t, xj - an opinion of the j-th basic agent in the moment t, rt - expenditures of the i-th control agent for the marketing impact (advertizing and so on) to the j-th basic agent in the moment t, aij - a coefficient of influence of the i-th basic agent to the j-th basic agent, bj -a coefficient of influence of the i-th control agent to the j-th basic agent, δ denotes a discount factor, i.e. δ = e-p. As different firms can exert influence to different members of the strong subgroups, we simply assume that if the i-th firm (control agent) does not influence to the j-th basic agent then bj = 0. Denote by A a matrix of the coefficients of influence among basic agents, i.e. A = {aij}i=i,2,...,n,j=i,2,...,n> AT - a transposed matrix of influences, Xt - a column vector of the values of state variables (opinions) in the moment t, є - a row vector of the dimension n formed by units, I - the unit n X n-matrix.

Assume that matrix of influences of the basic agents A is stochastic (from the right), i.e.

n

^ ^ aij 1, i 1,2,...,n.

j=i

In this case an action of the matrix AT from the left on any vector does not change the sum of its components. Thus,

є(Ат )T X0 = є(Ат )T-1X0 = ··· = єAT X0 = єХ0,

Aj = a2 = ··· = aT = 1 j = 1 2, ...,n,

where є is a row vector of n units.

2.2. Solving the i-th Firm’s Problem

Consider the problem of the i-th firm (4) - (7). In the one-period (without consideration of the zero period) game each i-th firm solves the problem

n n n

Ex° -Σ sj uj’0 + δΣ χj’1 =

j=i j=i j=i

n n n m _ n n

\ л 0 \ л i i,0

= L·, xi sj uj + δΣΣ^ν uj’0+δΣΣ^-x

j=i j=i j=ii=i j=il=i

with constraint n

i,^ 0

Eu/ ^ ri.

j=i

0

1

г,0

Uj

max

j =1,2 ,...,n

(8)

Optimizing by the Lagrange multipliers method, we receive the relations:

bi

ji

\ İ, 0 \ uj2

for any agents ji and j2 impacted by the i-th firm, therefore

uj’0 = uj( ЗА .

j2 j1 UJ

Denote J2jj=1 uj’0 by R0, then we have:

n

j = l

i’0

■E(b;

R0

2

2

and, therefore,

The substitution of (9) into (8) gives

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(9)

Σ*° - R0 + δΣ

j=1 i=1

\

β°Σ j2+δΣΣαο·x0 io-

j=1 j=1 1=1 “j ’ j=

max.

A non-conditional optimization by R0 implies

(r0

δ2

max 4

τΣ (j ■

j=1

Thus, the optimal strategy of the i-th firm is

,2

<bj

δ2

τΣ j 2.

2 0

r ■

Σ”=1 (j2 ι - j=1

Consider a two-period game. Each firm i solves the problem

Σ (x0 - sj uj’^ +

j=1

with constraint

δΣ

Xj

j - sj uf) + δ2^^

j=1 ' j= 1 “j ’ j=1·2.·

max

Σί,0 , 0

uj ^ r<>

j=1

where uj’1, j = 1, 2,,n, is the solution of the one-period game. We have

(10)

(11)

1 ^ ^ 7 i / i, 0 і ^ ^ 0

xj =L·, bj V uj + 2^ ач x ’

i= 1

1=1

min

Σ*:

i=1

i, 1

3 4 u3

Σανx _Σ6

i,1

3 4 u3

Σai3 (Σ63 V3+Σ«pixr

1 = 1

i=1

1=1

p= 1

JV u3

n m

Σ^Σ bi’

/ i, 0 І

u,’ +

i=1 l=1 i=1

Substituting (12) and (13) into (11) we receive

(Ατ )2 X 0

J 3

(13)

Σ

3 = 1

n / m

0 i i,0

X - s3 uj

+ δΣ Σ3

3=1 \i=1

Σα3χ0 - s3 u3’11 +

l=1

n m n n m

+δΣΣ 63v3+δΈΣ ^Σ6

3=1 i=1 3=1 l=1 i=1 uj ’ j = i,2,...,n

or, denote by A3 the sum of the elements of the j-th row of the influence matrix A,

2 x^0

+ δ2ε (Ατ)2 X

max,

n / m

Σ

3=1

0 i i,0

x3 - s3 u3’

+δΣ Σ3

3=1 \i=1

y^aij x0 - s» u*’1 I +

3 3

l=1

+^ΣΣ б} Juf + δ2 ^ Σ A363 Juf + δ2ε (Ατ)2 X0

3=1i=1 3=1i=1

Maximizing (14) with constraint

„і, 0

uj ’ j = 1,2,. ..,n

max, (14)

Σί,^ 0

u3 ^ ri,

3=1

by the Lagrange multipliers method, we receive the relations

(1 + δΑ32 ) 632 _

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(1 + δΑ3ι) 63i _\

32

31

0

0

for any agents j and j2 impacted by the i-th firm, therefore

i,0

i,0

f(1 + δΑ.

32 ) 63'2

I (1 + δΑ3ΐ ) 63

Denote EJ=1 u^ by R0, then receive:

Σu

i,0 _ 3 _

31

[(1 + δΑ3ι ) 63i]

■Σ[(1 + δΑ3) 63]2 _ R0

2

and, therefore,

i,0

3

[(1 + δΑ3) 63]2 R0

Σ”=1 [(1 + δΑ3) 63]2 ■

(15)

Substituting (15) into (14) and choosing only the terms with uj’ , we receive

/Я?

-R + «-

Ση 7 i

,·=ιΣ^ ju

j=i^i=v'j^vj v^i γ2Σj=iEi=i bj

\/ЕП=і [bj (1 + «Α)]2 V [bj (1 + «Α)]:

-R0 + «Ε

i=1

\

Я.°Е [bj (1 + «Aj)]2.

j=1

A non-conditional maximization of (16) by Я0 gives:

δ2

(R0) max = ΤΣ К' d + δΑ Я -

j=1

Thus, the optimal strategy of the i-th firm is

i’O

uj =

[bj (1 + «А-Я2 І δ2 ^ r. .2 0

j min ^ —Σ [bj (1 + «Aj^ ,r0

j=1

Σ”=ι [bj (1 + «Aj)]

Now consider the T-period game. Denote

Σδ

i / i’t

jV uj

= βt, t = 0,1, 2,...T.

j=1

Then we have

X1 = Ατ X 0 + β0,

X2 = (Ατ )2 + Ατ β0 + β1,

and iterations by t give

Xt = (Ατ )t X0 + (Ατ )t-l00 + (Ατ ^-2β1 + ··· + Ατ β^2 + β^1. Each i-th firm solves the problem

(16)

(17)

(18)

T

E*E

t=0 j=1

_t i i’t

xj - sj uj

t=0

«t hx t-E

i i’t

sj uj

i, 0 ^ ’ 3 ’

max,

(19)

with constraint

n

j=1

where uj’1, uj’2,..., uj’T -1(j = 1, 2,..., n) are the respective solutions of the (T — 1)-period, (T — 2)-period, one-period problems.

Choosing in (19) only the terms whith variables uj’0 j = 1, 2,..., n, we receive the expression

n

— E sj uj’0 + ε [«I + «2Ατ + ··· + «T (Ατ )T-1] β0.

j=1

Maximizing (20) with the constraint

Σ

j=1

i,0 - 0

uj < r.

(21)

by the Lagrange multipliers method, we receive the relations

i, 0 uj2

(1 + SAj2 +---+ δτ 1Aj2 1)bj2

(1+ SAj, + ... + ST—.j->)j j

for any agents j1 and j2 influenced by the Ath firm, where Aj denotes the sum of elements of the j-th row of the t-th power of the influence matrix, t — 1, 2,,T — 1. Denote as earlier the sum J2jj=1 uj’0 by Д°, we receive

uj

bj ΣΤ=-01 StAj

R0

* Aj

2 ·

j lbj^

A substitution of (22) into (20) with consideration of (17) gives

_b; ς« s‘a‘ -M

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

j= 1 i=1 j=1 t=0

Σί i’0 .

Σ uj +

j j

m n T — 1

ΣΣδ4ΣSt A

i=1 j=1 t=0 JJ2

j=1

bjE

T — 1 St At

t=0 δ Aj

(22)

2

—R + sE

i= 1

\

T — 1

RT Ρ5Σδ' A

j=1 \ t=0 у

Maximizing (23) by R0 without restrictions, we receive

VRF

Therefore,

δ2

T — 1

(Ri)max — 4 Σ ( bj Σ ^Aj

j=1 V t=0 у

Thus, with consideration of the constraint (21),

2

(bSET.—01 S-A·)2

δ2

T — 1

ΤΣ ης s‘Ad ,r.°

ЕП-1 (b;ET—01 S'A') i-i-

Respectively, in the period t (t — 1, 2 ,...,T — 1) we have

t=0

bjE^t SSAS

δ2

T — 1 — t

Σ"., (b’ET—0—‘ SMŞ

τΣ b5 Σ S'Ai .r,,

j=1

s=0

(23)

(24)

2

2

2

min

2

2

u

min

2

where r| are the resources allocated to the i-th firm by the principal in the t-th period. As the principal in the period t never chooses a value rt greater than

^4 Σ”=1 (bj δ6' As j , the expression (24) may be substituted by

i,t

U =

bjE

T-1-t

SsAj)2 rt

Ej=i (bj ET-n1-t δ6 As

2 ·

(25)

■‘j=1'yjX-‘s=0--

Considering the stochasticity of the matrix A, the expression (25) takes the form

bj ET=o1-t δ6 AS

bjE

T-1-t s=0 (

(bj

Ej=1 (bj ET=o1-t δ6 As) Σ”=1 (bj ET=o1-t δή Σ”=1 (bj)2

(26)

r

and does not depend on the length of period T. Thus we have proved the statement. Proposition 1. The optimal strategy of the i-th firm in problem (4)-(7) is

j = л£а2

’ Σ n=1 (b5)2

2.3. Solving the Principal’s Problem

The principal cannot decrease the value Σ"=1 xj·, and оп1у increases it by allocation the resources between the firms. It follows from the problem formulation that x* < Σn=1 x0 < x\ value Σ"=1 xj cannot become less than x*. Then

the principal’s optimal strategy is evident. The principal solves the problem (1)—(3), (5)-(6).

It is clear that for the maximization of her objective function the principal must optimally increase the value Ση=1 xj only up to the value x*, after what she must cease the allocation of resources to the firms. Then the value Ση=1 xj remains equal to x* until the end of the period of consideration. For the implementation of this strategy the principal already in the instant of time t = 0 must determine the instant t = H, when the value Yfjj=1 xj* becomes greater than x* due to the respective strategy. At the instant t = H — 1 the value of allocated resources should be reduced so that starting from the instant t = H the value Σ"=1 xj be strictly equal tox*.

Consider the H-period game. The principal solves the problem

H i n rn \ H / m

Σδ' (Σ xj — Σ ri| =Σδ' (εχ ‘ — Σ·

t=0 \j=1 i=1

t=o

i= 1,2,...,'

max, (27)

with constraint

£r.0 < R,

where r1, r2,..., r* 1 (i =1, 2,..., m) are the respective solutions of the (H — 1)-period, (H — 2)-period, ..., one-period problems, and

(bi )2 rt

л i,t V j) i , -i d TT -і

u j — 2 , t — 1,2,. .., 1

j ЕП=1 (bj Г

according to (26). Using the denotation (17) and decomposition (18), choosing in (27) only the terms that include r°, and considering that Aj = 1, we receive the expression

~Σ Г + (δ + δ2 + ··· + δΗ) εβ0,

i= 1

where

βο= σ bj

* ■ j

Σ

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

r0(bj)2

= 1 WΣ?=1 (bj)% j=i

Substituting (29) into (28), we receive

-Σr0 + (δ+δ2 + ···+δΗ)ΣΣ

0/hi )2

i=1

j=i <-ч ; (ь;)

-Σ r.0+Σ

i= 1

\

H

2Σδ'·

j=1 t=1

Maximizing (30) with constraint

(28)

(29)

(30)

Er0 < R,

i= 1

by the Lagrange multipliers method, we receive the relations

ΣΣ (ь? )2 r0

2 — ^0

ς;=1 (bj1)2 r°

for any influence agents R and £2· Denote as earlier the sum ^=1 r0 by R0, we receive

Σ

r0 =

Σ™1Σ?=1 (ь))2 < Σ?=1 (j )2

R0

therefore

r0 =

2 ·

Σ*., (bj)2r0 ΣΓ.1 Σ”„1 (ь; )

Substituting the expression (31) into (30) we receive

(31)

H

-R0 + VR0^ δ'

t=1

?0

\

ΣΣ;2·

i=1j=1

Maximizing the expression (32) by R0 without constraints, we receive

,_δ^Σ'=1 Ση=1 (ь; )2

H-1

j xt

Σ*

t=0

II

2

and

δ2

H-1

= 7ΣΣ (4Σ δ'

i=1 ‘ = 1

t=0

2

Therefore, with consideration of the constraint R0 < R the principal’s optimal strategy is

r0 =

/H-1

)2 St

Σ"=1(b‘} 2 min і ΣΣ (’‘ОΊΣ δ4 ,R

ΣΓ=1Σ”=1 (b‘)2 I 4 hh \L·

(33)

Completely similar to (33) we receive for t = 1, 2,,H — 1:

r' =

Σ"·1 (b‘ (2

ς“,ς n=. (ь‘ )2

δ'2

■ min

ΣΣ(4·

i=1 ‘=1

/H-1-t

Σ δ’

V ’=0

,R

i = 1, 2,

(34)

Using the denotation (17), decomposition (18) and the formula (26), we can write the value ΣΣ xH în ^he H-period game, t = 0,1,2,..., H, where H is exactly the instant when the value Σ"=1 x‘ first becomes equal to x*, in the form

εΧH = εΧ0 + εβ0 + εβ1 + ··· + εβH-2 + εβH-1 =

Σχ°+Σ

‘=1 i=1

\

'Σ^

‘=1

Σ

i=1

\

Σ(’; у2+···+Σ

ъ ^ V ‘ ‘=1

i=1

\

„H-1

E(6‘ )2'

‘=1

Substituting (34) into (35), we receive

(35)

εΧH = Σ··

‘=1

\

ΣΣ(’;

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

i=1 ‘=1

,· )2 . I δ

min < -2

\

m n /H-1

, 2

ΣΣ(’$ ΠΣ δ4 +

i=1 ‘=1

+ ··· +

\

ΣΣ(’;

i=1 j=1

,2 . I δ

mm < 2

\

fH-1-t

ΣΣΦ )(Έ δη WR+ ···+

i=1 ‘ = 1 V ’=0 /

\

ΣΣδ

i=1 ‘=1

‘, min < -‘ I 2

\

ΣΣ(’ί )2 (1+δ) +

i=1 ‘=1

A

ΣΣ(’

δ

‘, min < -

‘ і 2

\

ΣΣ^; Λ'Μ.

i=1 ‘=1

So, t = H is the time instant when the value

i=1 ‘=1

Δ

\

ΣΣ(’:

i=1 ‘=1

δ

min < -

‘ і 2

\

ΣΣ(’; +

i=1 ‘=1

2

i

0

J

2

2

m n

ΣΣ j2

min

2N

ΣΣΜ)2 <1 + *) и/д

i=1 j= 1

+ ··· +

A

ΣΣ®

i=1 j=1

/H-1-t

min

2,ςς jί ς n ж

+···+

= 1 j=1

2

m n

ΣΣ(4>2

min

2 N

m n

ΣΣ(*5)2

H-1

Σδ>

s=0

first becomes greater or equal than x* — Xj=i. x0 ^ is possible for a t < T). In

this case denote

2 . I δ

min < -2

\

EZ(*j )2·^

i=1 j=1

It is clear that the summary amount of resources allocated by the principal between the firms in the instant H — 1 so that in the in stant H the sum of the values of state variables be strictly equal to x*, must be equal to

R

(Δ1)2

ΣΙΣΣ n=1 j2

(37)

We obtain the statement.

Proposition 2. The principal has in the problem (1) (3), (5) (6) the following optimal strategy. When 0 < t < H — 2

rİ =

ЕП.1 j2

δ2

ΣΪ1 Σ“,1 (bj) ‘

when t = H — 1

■ min

rH-1 =

m n /H-1-t \ 2

ΣΣ«·)2 Σ δ') ’R\-

i= 1 j=1 V s=0 /

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

ς;=! (b;(2 R.

Σ’ΣΣ n.1 (bj)

2

and when H < t < T

T =0, і = 1,2,...,m.

Corollary 1. The maximal guaranteed payoff of the principal is equal to

SH-1\ n /H-1

Σδ' Σx0+ (Σδ'

vs=0 ) j=1

4 s = 1

\

«°ΣΣ j2 — r0

i=1 j = 1

/H-1

+ ς δ^

, s=2

\

^ΣΣ j2—iR1 + ···+

i=1j=1

f H—1 N

Σ s'

Vs=t+1 у

\

^ΣΣ j2 - S‘R‘

i= 1 j=1

H-1

\

m n T

(bj)2 - δΗ-2RH-2 - SH-1R + χ*Σ δh

i=1j=1

h=H

where

δ2

m n /H—1

V 2

R0 = j ( Σ δΊ ' R'

i=1 j=1

, s=0

δ2

m n /H—2

.2

R1 =-fjEE(bj) (ΣδΊ .R

i=1j=1

δ2

m n /H—1 — t

,2

Rt = ■“”{ΤΣΣj Σ δΊ .R

i=1 j=1 V s=0 /

(38)

(39)

(40)

(41)

2

2

І г 2 m n

ύΈΈ (bj )2 (1+ δ)2, R, (42)

i=1 j=1 )

by Rt is denoted the sum Y2Jm=1 rt’ t = 0,1,2, ■ ■ ■, H — 2, and the value R is determined by the expression (37).

Proof. It can be seen from (34). □

2.4. System Compatability Index

Denote the set of collections of equilibrium strategies of firms provided that principal’s strategy is {r^m^ by NE({ri}”=1). Calculate the system compatibility index for the considered problem:

max

SCI =

{ri}m=

mm

{uj }m=i;:

71eNE({ri}m=1) Jo ({ri}^1, {uj}i= 1;j=1)

max{ri}£l max{uj,}-; - J0 ({ri})=1, {uj}”=;1;j= 1)

(43)

This index shows the measure of compatability of system. The numerator of (43) is the payoff of the principal in the case if the firms choose the most unfavorable for principal equilibrium strategies. The denumerator of (43) is the payoff of the principal in the case if the firms choose the most favorable for principal strategies.

Given the principal’s strategy the set of equilibrium strategies of each agent is one-element in our case, so the numerator of the expression (43) is equal to (38). For calculation of the denominator in the expression (43) let us assume that each agent maximizes not his payoff but the principal’s payoff given her strategy, i.e. the agent solves the problem

T n

J δ^^ xj ^ max,

t=0 j=1

with constraint (3) and (5) - (6).

If J2j=i xj > x* then the optimal strategy of each firm is evident: it does not invest to the advertisement. Moreover, it is impossible because in this case the principal does not allocate any financial resources to the firms, i.e. then uj’4 = 0.

Let xj < x*· Unlike the principal, any agent is unable to estimate a priori

the number of time periods H which is required for the sum of values of the state variables becomes equal to x*, but it is redundant because the optimal strategies of the firms do not depend on the game’s length.

Consider the H-period game. Denote as earlier

( m /—\ П

Σ^νΰπ = β4, t = 0,1, 2 ,...H - 1. (44)

\i=1 /j=1

Then

X1 = X0 + β0,

X2 = X0 + β0 + β1, and so on, the iterations by t give

X4 = X0 + β0 + β1 + ··· + β4—2 + β4—1. (45)

Each і-th firm solves the problem

T

Σ*Σ

t=0 j=1

T

V £UX4

max,

(46)

with constraint

n

j=1

(47)

where uj’1, uj·’2,..., v,j’H-1(j = 1, 2,..., n) are the respective solutions of the (H — 1)-period, (H — 2)-period, ..., one-period problems.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Choosing in (46) only the terms whith variables uj’0j = 1, 2,..., n, we receive the expression

(δ + δ2 + ··· + δΗ) εβ0

Η—1 n m

δΣδ'ΣΣ

t=0 j=1i=1

(48)

Maximizing (48) with the constraint (47) by the Lagrange multipliers method, we receive the relations

\

i’0

i’0

Uji

for any agents j and j2 by the і-th firm. Denote as earlier the sum

Yfjn=1 uj’0 by Д°, we receive

Substitution of (49) into (48) gives

ΜΣδ· Σ

. t=0

\

j=1

Maximizing (50) by R0 with constraint (47), we receive

(Ro

r0.

max

(50)

Thus we have

i,0

(ь; )2

Σ”=ι j2

Let us write the strategy of the i-th firm in the ff-period game. For t the optimal strategy of the i-th firm is

i,t (ь;)2 t

; Σ”=1 (b;)2 i

0,1,

M-1

If t = H then uj,H =0 j = 1,2,... ,n.

Then the principal’s problem is the same as the previous problem already solved in searching the numerator on (43), and the denominator of (43) is also expressed by the formula (38). Therefore, in this problem the players’ interests are ideally compatible, and SCI = 1.

Proposition 3. In the model defined in subsection 2.1.

SCI = 1.

Remark 1. A non-trivial situation arises. In the t-th period the firm should choose the advertisement cost equal to ^ T^j=1 (bj)2 ^ΣH—1-t j if this value does not

exceed ri, and choose rt if the value ^ Ση=ι. (bj)2 |1-t ^ is greater than r|.

But the firms do not know the index H, and therefore, the value δ^Ση=1 (bj)2 ^ΣH=01 t . However, they know that the principal never allocates

them more resources than ^ Ση=1 (bj)2 (Σ^—01-t j and respectively they al-

ways choose rf. The principal could use it and allocatd to all firms in the t-th

period more resources than ^ Σn=1 (bj)2 ^ΣfL-)1-t , and respectively com-

pel them to invest in the advertisement more than it is advantageous to them. However, it is not advantdgeous to the principal herself to allocate more than

^Σn=1 (bj)2 ^ΣH-)1-t ! Therefore, the principal always allocates the value

^Σn=1 (bj) (Σs=0 j if it does not exceed her marketing budget R, oth-

erwise she allocates the resources as it is optimal for herself. This remains valid until the instant H — 1 when the principal allocates a reduced amount of resources which provides in the instant H that the sum of values of the state variables is strictly equal to x*. Thus, the interests of the principal and the agents (firms) are completely compatible.

3. Difference Stackelberg Game with a Constraint on the Sum of State Variables. Case of the Stochastic from the Left Matrix A

Now take a standard assumption that the influence matrix A is stochastic from the left, i.e.

n

'У ^ aij 1, j 1, 2,... ,n,

j = 1

and other assumptions are the same as in the previous section. The model takes the form

Jo = ς *

t=o

!>j xj -Σ r

j=1 i=1

^ max,

(51)

ri > 0, Σ r\ < R, t = 0,1, 2 і

i=1

1, 2,...,m,

(52)

x* < Zj xj < x*, t = 0,1, 2, ...,T — 1,T, (53)

j = 1

Ji = Σ s*

t=o

n

i,t ^ t

z^uj < ri, uj,t > 0, t = 0,1, 2,..., T, i = 1,2, ...,m, (55)

j=1

m _ n

j = Σ j / it \ ^

JUj + aij x i, x j x j 0, j = 12,...,n, (56)

i=1 l=1

o' o'

sj = { 1, Λ II (57)

\0, -О -о

where Zj > 0 j = 1, 2, ...,n, are components of the positive (right) eigenvector which corresponds to the Frobenius eigenvalue 1 of the matrix A.

Let us reduce the standard situation to the already considered case. Let Z be an eigenvector corresponding to the eigenvalue 1 of the matrix A:

Σ

j=1

(zj xj —

г i,t \

j uj )

—l· max,

(54)

Z

/ Z1 \

Z2

VZn J

, Zj > ° j = 1,2,...,n.

It is evident that the diagonal matrix

/ Z1 0 ... 0 \

„ 0 Z2 ... 0

V 0 0 ... Zn J

is the matrix of transfer to the stochastic (from the right) matrix P:

P = Ξ-1ΑΞ.

Then

A = ΞΡ.

l.e.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

AT

7-1

P T

and the problem (51)-(57) can be formulated in other coordinates. Introduce a family of matrices Бг:

Бг =

(Ь\ 0 ... 0 \

0 Ь\ ... 0

V 0 0 ... ЬП J

and vectors

/ I i~t

'Uг

i = 1, 2,..., m.

. it

\ V un /

Then the constraint (56) takes the matrix form

i.e.

Xt+1 = Σ Б VU^ + Ξ-1Ρτ ΞΧt

i=1

Yt+1 = Б iVUÂ + PT Yt

where

( x1 N (y\\

X * = x2 , Y * = У2 = ΞΧ *

\ХП/ UnJ

Бг = ΞБi =

or in the coordinate form:

Alb! 0' ... 0 \

0 z2b2 ... 0

\ 0 0 ... гпЬгп J

(Ъ\ 0 ... 0 \

0 b\ ... 0

V 0 0 ...bin)

t+1 btV uj,t + Σ Pj y*

i= 1 1=1

yj =

and the constraint (53) looks like

x* <Σ yj < x*.

The expression (54) takes the form

j, = ς δ'

'=0

Σ(^·-

j=1

i i,' \

sj U )

max,

and (51) takes the form

Jo = Σδ'

'=0

Σ^· — Σ1

j=1

i=l

—l· max.

Other expressions in the problem (51)-(57) do not change. Based on the solution of the problem from the Section 2, we find the optimal strategies of the firms:

i,'

U =

b, i r'

(zjbj)2 r'

Ej=i (bj) Σ·=ι (zjbj)

2 ’

i = 1, 2,..., m, t = 1, 2,..., T.

As for the principal, let in the moment t = H the value

,·\2 . I δ

\

ΣΣ(*>b;

i=1 j=1

jbj, min . j j 1 2

\

ΣΣ(ζ;j +

i=1 j=1

A

ΣΣ(ζ>b

i=1 j=1

j min 1 2

\

ΣΣ(ζ;bj)2 (1 + δ)

i=1 j=1

A

ΣΣ(ζb

i=1 j=1

0, min < -

V 1 2

\

/H-1-'

ΣΣ(ζ>j2 Σ δ'Mr

i=1 j=1

s=0

A

ΣΣ(ζ>b

i=1 j=1

j mln 1 2

\

/H-1

ΣΣ(ζ; bj) 2 Σ δΊ g/r

i=1 j=1 V s=0 /

for the first time becomes greater or equal than x* — ^ n=1 zjxj (if ^ is possible for some t < T). Denote

Δ

İ1 = x* — ^3j'j

j=1

z;x0 — I Δ —

\

ΣΣ(ζ; bj

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

i=1 j = 1

)2 . I δ

) min %

2

\

ΣΣ(ζ; j m/r

i=1j=1

we find the total amount of resources that the principal should allocate to the firms in the moment H — 1 so that in the moment H the sum

δ

2

δ

be strictly equal to x*:

n

Σ zj xj

j=1

(Δ1)2

Em γ-νη

i=1 2^ j=1

(zj bj)2

Thus, the principal’s optimal strategy is the following: when 0 < i < H — 2

Ej"=i M)2 Σ”=1 ЕП=1 (Zj bj )2

δ2

/H-1-t

min

ΣΣ(ζ>j МС δ' . R;

i=1 j=1

s=0

when i = H — 1

when H <t < T

„H-1

ς;=! jzjj)

ΣΙΣΣ n=1 (Zj bj)

2 R;

Г =0, * = 1,2,---,m.

The maximal guaranteed payoff of the principal is equal to /H-1\ n (H-1

(Σ δ*) Σ z,χ0 + (Σδ

,s=0 / j=1

/H-1

\

β0ΣΣ(;> j2 - R0+

i=1 j=1

Σδ*

\s=2

H-1

\

R1ΣΣ bj2 -δ^1 + ■·· +

i=1 j=1

Σ s·

Vs=t+1 у

\

№ΣΣ(*4 bj)2 - ‘tRt + ■■■+

i=1 j=1

+ δ

H-1

\

RH-^^I3 (zjbj)2 - δH-2RH-2 - £H-1 R + x* Σ

i=1 j = 1

where, as can be seen from (34),

2

h=H

R° = min<! δ~τΣΣ (z, bj) 2| Σ M · R·

(H-1

i=1j=1

δ2

/H-2

R1 = min \ ΤΣΣ (z,j ΙΣ δΊ ,R

i=1 j=1

δ2

/H-1-t

Rt = min 1 ΤΣΣ(Μ> Σ δΊ ' R’

i=1j=1

2

Іе2 ” n I

^ΣΣ (z, bj )2 (1+δ)2 >R>

i=1 j=1 J

where Rt denotes the sum Y/”=1 r\, i = 0,1,2,..., H - 2, and Ле value R is determined by the expression (58).

It is clear that SCI = 1 as in the previous problem from the Section 2. The expressions for the maximal guaranteed payoff in this case differ from those in the section 2 only by the multipliers z, - the components of Frobenius eigenvector of matrix A. So, in this problem with the homeostasis condition, as in the similar problem without this condition, the interests of players are completely compatible.

4. Differential Stackelberg Game with a Constraint on the Sum of State Variables. Case of the Stochastic (from the Right) Matrix A

4.1. The Problem Statement

Consider this problem in continuous time. Now the model takes the form

Jo = Г e-pt n m

J 0 53xj(t) ri(t) dt ^ max, (59)

j=1 i=1

m

ri(t) > ο, Ση(t) - R, tє [o,t], * = 1 2,.. ·,m, (60)

i=1

n

x* --- xj(t) --- x*, t Є [0,T], (61)

j=1

Ji = Г e-pt n

Jo 53(xj(t) - sjuj(t)) dt ^ max, (62)

_j=1

n

^2uj(t) --- ri(t), u j(t) > ° t є [0,T], * = 1 2, ···,m, (63)

j=1

m n

xj = bjyjuj(t) + Ş^ aijxi (t), xj (0) = xjO, j = 1, 2,.. ., n, (64)

i=1 l = 1

sj J 1, if bj > 0, (65)

\0, if bj = 0.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Here aij is a coefficient of influence of the ί-th basic agent to the j-th basic agent in the discrete model. When we move from the discrete description to the continuous one, the influence matrix A is substituted by the matrix A in the form

A = A-I.

Thus,

aij, if * = j,

aij C if * j·

Let us first assume that the matrix A is stochastic (from the right), i.e.

a

^ ) aij 1, * 1,2, · · ·, n·

j= 1

Solving the problem of the ί-th firm, we can substitute n state variables xj by their sum and denote by x the only state variable:

n n

^ ^ xj x^ xj0 x0,

j= 1 j= 1

then

ΣΣ alj xl ΣΣ alj xl Σχί = Σχι — Σ xj =x - x = °>

j=1 l = 1 j=1 l=1 j=1 l=1 j= 1

and the conditions (62) and (64) respectively take the form

ґ t

Ji

-pt

x — Σ sj uj(t)

j=1

dt —> max,

and

χ = ΣΣ bj\J ujx(°) = xo·

j=1i=1

4.2. Solution the Problem of i-th Firm

The Hamilton-Jacobi-Bellman equation has the form

(66)

(67)

dv I n dV. n m у__________

pV — st = miXn Ix(t) — Σuj (t) + ^ΣΣ ь5У“їм

■’ 1 j=1 j=1 fc=1

with constraint

Σ uj(t) - ri(t)·

j=1

Maximizing by uj, j = 1, 2,, n, bj = °, we receive

0Vi... 1

аЗ 5 —1 = μ,

where μ is a Lagrange multiplier. Then for any 1 — j1; j2 — n:

(68)

(dVi/dx)ljj2 j fuj2Y

(dVi/dx)bj bj I uj

Denote the sum

by Ri (t). Then

(b

ΣΣ (bj)

jJ j=1

Ri(t)(bj

J j (bj )2'

As we take the linear Bellman functions,

Vi (x, t) = ai(t)x + βi(t),

then we can write the equation (68) with consideration of (69) in the form

. . m ” bkJRkMbk

pai(t)x + p^(t) — a .(t)x — β l(t) = x — R(t) + al(t) —j ,

j=1\t j (j)2

(69)

2

or

pai(t)x + ρβ1 (t) — α l(t)x — β i(t)

x — Ri(t) + a1 (t) ^2

k=i

\

Rk(t)J2(bk)2· (70)

Equation of the coefficients at the variable x in the left and right hand sides of the equation (26), we receive a differential equation for ai(t). Its solution gives

ai(t) = Cept + 1 ·

P

Using the boundary condition ai (T) = 0, we find the integration constant

C = —1 epT ·

P

Therefore,

ai (t) = p (l — ep(t-T·

Function ai(t) is the same for all firms, then we will o mit the index i of the functions ai(t). Equating the constant terms in the left and right hand sides of the equation (26), we receive a differential equation for βi (t):

β i(t) — ρβ i(t)

Ri(t) — ai (t)J2

k=l

\

Rk ΜΣ (b

and solve it by the method of variation of parameters. We have:

ei(t) = eptC (t),

C (t)ept = Ri(t) — a(t)

k=1

\

Rk «)£ (j

3 = 1

C(t) = f e-pT

J 0

\

Rk (τ (b

j=1

Ri(T) — α(τ) Σ

k=1

Considerating the boundary condition β% (T) = 0, we receive

P t m

C = — / e-pT Ri(T) — α(τ)]T

άτ + C.

k=1

\

Rk(Tj2

3 = 1

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

άτ.

Therefore,

βi (t) = ept^" e-pT

κ(τ )Σ

k=1

\

Rk (τ )£ (j )2 — ft (τ)

3=1

άτ.

2

Choosing the maximal value of the right hand side of the expression (26) in dependence of R (t), we have

-Ri(t) + aj (t)y/Rffî

N

^ max,

hence

1 n

(Ri(t))max = 4(«(ί))2Σ (bj) 2 ·

j=1

Thus, the value Rj(t) (omitting the index г at aj(t)) is equal to

Ri(t)=min 14(α(ί))2έ (bj)2 ,ri(t)· (7i)

Thus we have proved the statement.

Proposition 4. The optimal strategies of firms in problem (66) (67) are defined by expression (69), where Ri(t) defined by (27) and

a(t) = 1 (l - ep(t-T>) ·

4.3. The Principal’s Problem

Now consider the principal’s strategy. The principal cannot decrease the value Y)jj=1 xj s^e can only increase it by allocating the resources to firms. Then her optimal strategy is evident. From the problem formulation it follows that J2jj=1 xj0 > x*. The principal solves the problem:

J0

e-Pt

J2xj(t) -Σ r'i(t)

dt ^ max,

with constraints (60)-(61), (63)-(65).

It is clear that for maximization of her payoff the principal should increase the value Ση=1 xj (t) as in the previous discrete problem but only up to the value x*, after what she must cease to allocate resources to the firms. Then the value Y)jj=1 xj (t) will remain equal to x* until the end of the game. For implementation of this strategy the principal should already at the instant t = 0 determine the instant t = h when the value Y)jj=1 xj (t) becomes equal to x* given the respective strategy.

As according to (27) the principal will never allocate to any firm г in any instant t more resources than

тИ‘))2Е <b5 )2.

j=1

we can rewrite the expression (69) in the form

ri(C (bj) Ej=, (bj2'

Substituting to the equation (67) for the sum of variables x the expression (72), we receive

χ = Σ

\

ri (ί)Σ (bj

x(0)

xo.

(73)

Integration the equation (73), we find

x(t)

N

ri(τ)^ (bj)2dr + xo.

j=i

(74)

Thus, h = x-1(x*) is the instant of time when the right hand side of the expression (74) becomes equal to x*.

Given the value of h and using the assumption about stochasticity of the influence matrix, we can reformulate the principal’s problem as follows:

Jo

e pt \ x(t)

Eri(t)

i= 1

dt ^ max,

with constraints

m

ri(t) > 0, (t) < R, t Є [0, h], i = 1, 2,. .., m

i= 1

(75)

and (73).

The Hamilton-Jacobi-Bellman equation takes the form:

pVo

dVo

dt

max < x(t)

і,1<i<m I

Eri(t) +

i= 1

dVo

dx

Σ

\

η(^Σ (bj

j = 1

2

with constrain

m

Eri(t) < R.

i=1

We take the linear Bellman function:

Vo (x, t) = ao(t)x + βo (t). Substituting (77) into (76), we receive

pao(t)x + ρβ°^) — α o(t)x — β o(t) =

(76)

(77)

= max

ri(t),1<i<m

=(t) — Σ ri(t) +ao (t)^

\

ri(t)YJ (j

2

(78)

Equating in the left and right hand sides of the equation (78) the coefficients at the variable ^ ^^^^rential equation for ao(t):

a o (t) — pao(t) = — 1.

It coincides with the differential equations for oi (t) but has other boundary condition

o0(h) = 0.

Therefore,

o0(t) = 1 (l - ep(t-h)) , t Є [0, h]. (79)

Equating the constant terms in the left and right hand sides of the equation (78), we receive a differential equation for β0(ί):

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

β0(t) - ιβ 0(t) = ς ri(t) - o0(t)E

\

i=1 i= 1

Its solution by the method of variation of parameters gives

e0(t) = C (t)ept,

ri(t)E (bj)2.

j=1

C (t)ePt = E ri(t) - o0 (t)E

\

ri(t)E j >

j= 1

C(t) = / IEri(T)-α0(τ)E

j0 \i=1 i=1

Using the boundary condition we find

\

ri (t) E (bj )2 ) e ρτ dT + C.

j= 1

β0 (h)=0,

ft j m m

C = - i IEri(τ)-α0(τ)E

'0

. i=1

= 1

\

ri(T)Σ (bj )21e ρτ dT.

j=1

Therefore,

C(t) = у I o

'(T )Σ

i= 1

\

ri(T) E (bj)2 - E ri(τ) 1e ρτdTj

j=1

i= 1

e0(t) = ept/

*0(τ) ( Ε

i=1

\

ri(T ^ (bj) 2 ) -Ε ri (τ )

j=1

= 1

e ρτ dT.

Particularly, when t = 0 we have

oh

β0(0)= /'

J0

0

(T) IE

= 1

\

ri(T^ (bj) I -E ri (t)

j = 1

e-pr dT. (80)

Maximizing the right hand side of the expression (78) by ri(t), i = l, 2,..., m, with constraint Σ”=1 ri(t) < Д, we receive

l - o0 (t)

yj (bj )2

2 л/ ri(t)

h

where μ is a Lagrange multiplier. Then

(ij· )2 = .

vjj2 NS)

Therefore, for any 1 < i,, i2 < m it is true that

ri2 (t)

ς;=, )j )2 ΣΣ, )2

■r-i· (t).

Denote the sum r.(t) bУ r(t). Then

Therefore,

ri· (t)EE (bj)2 =r(t)E (bj·)2 ·

i=1j=1 •=1

r.(t)= *)- j (

4 2 ·

ςς,ςε j2

Substituting (81) in the right hand side of (78) we receive:

x(t) — r(t) + a0 (t)

i=1 j=1

A non-conditional optimization of this expression by r(t) gives

~(t)EE ·2·

(r(t))r

4 (a0 (t))2 EE(bj )2

.=1 j=1

Thus, for the instants t Є [0, h] when

R > (a0«)) ΣΣ j '

i=1 j=1

the principal’s optimal strategy is

r.(t) = - (a0(t))2Σ (bj)2 , i = 1, 2, · · ·,

j=1

For the instants t Є [0, h] when

R< - (a0 (it ΈΕ (bj) 2 >

i=1 j=1

the principal’s optimal strategy is

r,W = Σ?-·( bj )2 2

ς ς,ς n=i )bj )2

Д i = 1, 2,···

^j=1 V j'

Combining the two expressions for r.(t) in the one formula, we receive statement.

(81)

following

Proposition 5. The principal’s optimal control is

\ 2

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

n{t) =

Ση=ι m2

min I 4 (a0(t)) 2^Σ (b*)2 >

m n

2 /, k \ 2

where

ΣΓ=ιΣ?=ι (bk)2 r *=i j=i

r i (t) = 0, t Є [h, T ], i = 1,2,

1

α0 (t) = p (l - ep(t-h>) . Using (79) and (80), we can write:

max J0 = V0(x(0), 0) = α0(0)x(0) + β0(0)

rijl^i^m

t Є [0, h], (82)

p (1 - e-ph\ X0 +f P J 0

V) ( Σ

. i=i

\

ri(τ~)Σ(bj)21 - Ση(τ)

j=1

i= 1

e pT άτ, (83)

when the value ri (τ) is determined % the expression (82), and α0(τ) - by the expression (79). Then according to (83) the principal’s guaranteed payoff is equal to

rT і і

-ph\

max

ri,1<i<m

J0 + x* / e-ptdt = 1 (e-ph - e-pT\ x* + 1 (1 - e-ph\ X0+ Jh P P

I m

0 (τ) |Σ

\

ri(r~)Σ (bj) I -Σri(τ)

j=i

-pT άτ,

(84)

where the instant h is determined by the expression (74).

4.4. System Compatibility Index

Calculate the system compatibility index:

max{ri}m min j n ({ }„) J0 ({ri}”=i, {uj }”L;i;jn l)

SCI = --------------- ,>,= 1ij = 1-(l ίη1 Xm Si :rn; n \--------· «5

max{ri}m=1 max{„j}m!1.j=n1 J0 ({ri}I=i, {uj}i=;i;j=i)

As soon, SCI shows the measure of compatability of system. The numerator of (85) is the payoff of the principal in the case if the firms choose the most unfavorable for principal equilibrium strategies. The denumerator of (85) is the payoff of the principal in the case if the firms choose the most favorable for principal strategies.

Given the principal’s strategy the set of equilibrium strategies of each agent is one-element in our case, so the numerator of the expression (85) is equal to the right hand side of (84). For calculation of the denominator in the expression (85) let us assume that each agent maximizes not his payoff but the principal’s payoff given her strategy, i.e. the agent solves the problem

Ji

0

e pt Xj (t)dt

j=i

^ max,

with constraints (61) and (63) - (65).

If J2j=i xj > x* then the optimal strategy of each firm is evident: it does not invest to the advertisement. Moreover, it is impossible because in this case the principal does not allocate any financial resource to the firms, i.e. then uj^ = 0.

Let Yjj=1 xj < x*· Unlike the principal, any agent is unable to estimate a priori in which instant of time h the sum of values of the state variables becomes equal to x*, but it is redundant because the optimal strategies of the firms do not depend on the game’s length.

Consider the game in the time segment [0, h]. Similar to the Section 3 we receive for the optimal strategies of all firms the expressions similar to (69), (27), (79):

ujC) = ,R,(t) = min 1 1 a(t)Şj (bjf ,α(ί) = 1 il - ep(t h)

j = 1

As the principal will never allocate to any firm more resources than

l

—a

4

П

«Σ (Ο2·

we can write

uj(t)

(bj)2 a(t) Σ?=ι (bj)2 ’

i = l, 2, j = l, 2, t Є [0, h].

Then the principal’s problem is the same as already solved, and the denominator in (85) is also expressed by the formula (84). Therefore, in the problem with constraint on the sum of state variables the interests of players are completely compatible, i.e. SCI = l.

Proposition 6. In the problem defined in subsection 4-1.

SCI = l.

Remark 2. A non-trivial situation arises agarn. At the instant t the firms should choose the advertisement cost equal to (l/4p) (l — ep(t-h^Yjjj=1 (bj) if this value does not exceed r|, and choose rj, otherwise. But the firms do not know the instant h, and Ле value (l/4p) (l — ep(t-h^^2jj=1 (bj) , respectively. However, they know that the principi wil( never allocate to them more resources than

(l/4p) (l — ep(t-h)) J2j=1 (bj) , and they always choose r|. The principal could use it and allocate to al( )rms in the moment t more resources than

(l/4p) (l — ep(t-h^) Yf^n=1 (bj) > and compel them to invest more. However, it is not

advantageous foe the principal herself to allocate more than

(l/4p) (l — ep(t-h^Yffj=1 (bj) ■ Therefore, she always allocates

(l/4p) (l — ep(t-h)) £”= (bj)2, if this value does not exceed her marketing budget R, otherwise she allocates the resources in the way optimal to her. This strategy remains valid until the instant h, after what the principal does not allocate more resources because the value x* is achieved optimally. Thus, in the problem with constraint on the sum of state variables the interests of the principal and the firms are completely coordinated as in the problem without the homeostasis condition.

5. Differential Stackelberg Game with a Constraint on the Sum of State Variables. Case of the Stochastic from the Left Matrix A

Now take a standard assumption that the influence matrix A is stochastic from the left, i.e.

n

'У ^ aij j

j=1

Let zj > 0, j = 1, 2,..., n are components of the positive (right) eigenvector corresponding to the Frobenins eigenvalue 1 of the matrix A.

The model takes the form

Jo

■T

e-pt

t=0

J2zj xj(t) ri(t)

j=1 i=1

^ max,

(86)

n(t) > 0, 5>(t) - R, t Є [0,T], i = 1, 2,..., m,

i=1

x* — Zj Xj (t) — x*, t Є [0,T ],

j=1

Ji = Σ δ

t=o

(zjxj(t) - sjuj(t)

j=1

—> max,

(87)

(88)

1, if bj > 0, 0, if bj =0,

j=1

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

uj(t) — ri(t), uj(t) > 0, t Є [0,T], i = 1, 2,

x j(t) = 53 bj\/ uj(t) + Σ a,-jxi (t), xj(0) = xj0, j = 1, ft-.,n. (89)

i=1 l = 1

Here aij ft а coefficient of influence of the i-th basic agent to the j-th basic agent in the discrete model. As earlier, when we move from the discrete description to the continuous one, the influence matrix A is substituted by the matrix A in the form

A = A — I.

Therefore,

Let

aij

aij, if i = ft

aij ft if i j.

Z

/ Z1 \

Z2

, zj > 0, j

VZn /

1, ft.. .,n

be the Frobenius eigenvector of the matrix A, corresponding to the eigenvalue 1 of the matrix A. The diagonal matrix

(zi 0 ... 0 \

0 Z2 ... 0

V 0 0 ... zn J

is the matrix of transfer to the stochastic (from the right) matrix P:

P = Ξ-1ΑΞ.

Then A = ΞΡΞ-1,

i.e. Ατ = Ξ-1Ρτ Ξ

We can reformulate the problem (86)-(89) other coordinates. Introduce a family of matrices Бг:

/b\ 0 ... 0 \

0 b2 ... 0

V 0 0 ... bn /

and a family of vector functions

νϋψ)

( Vu\(t)\ уЯ2Т

\Vun(t)J

i = 1,2,. .., m.

Then the constraint (89) takes the matrix form

X(t) =Σ Бгу/ЩЇ) + (Ξ-1Pτ Ξ - I)X (t), г=1

where

m

Y(t) = ΣБгу/ЦЩ + (Pτ -1)Y(t),

г=1

( X1 (t) ( V1(t) \

X (t) = X2 (t) , Y(t) = У2 (t) = ΞΧ (t),

\xn(t)j \yn(t)j

1 X1(t) \ ( y1(t) \

X(t) = X2 (t) , Y(t) = У2 (t) = ΞΧ (t),

\Xn(t)j \yn(t)j

/zibi 0 . . 0 N /bi 0 . . 0 \

Bi --- ΞΒ --- 0 Z2b2 . . 0 0 b* . . 0

^ 0 0 . . znbn / V 0 0 . b )

or in coordinates:

m____________ n

ўі (t) = Σ4\Ι u(t) + Σ pij yi(t)·

i=1 l=1

The constraint (87) takes the form

x* — ^)yj(t) — x*, t Є [0, T].

j=1

The expression (86) is written as

Jo = / e Jt=o

-pt

X)yj(t) -X) ri(t)

j=i

i=1

dt ^ max,

and the expression (88) as

Ji — I e Jt=0

-pt

X)(yj(t) - sjuj(t)

j=1

dt —> max .

Other expressions in the formulation of the problem (86)-(89) do not change. Based on the previous solution, we find the optimal strategies of all firms:

. (bj) ri(t) (z,·bj2 ri(t)

uj (t)= J ^Л2 — Д n j ( i (.) , i — 1, 2,...,m, t Є [0,T],

Σ”=ι bj

Ση=ι (Zjbj)2

and the principal:

ri(t)

where

Ση=ι (Zjbj)2

min <{ 4 («0 (t))2 ΣΣ (zj j )2’ t Є [0, h]

ΣΓ=ιΣ”=i (zjbk) r fc=!j=!

ri(t), t Є (h, T], i — 1, 2, .. ., m 1

(t) = P(l - ep(t-h)) , t Є [0,h].

The principal’s maximal guaranteed result is equal to

1 (e-ph - e-pT) x* + 1 (1 - e-ph) xo+ ρ ρ

+

o

α0(τ)

ri(τ )Υ^ (zj bj)2

j=ı

J2ri(r)

i= 1

e ρτ dr,

where the instant of time h solves the equation

i.e. h is the instant t when the right hand side of (90 becomes equal to x*.

As soon, SCI = 1 as in the previous problem from the Section 4. The expressions for the maximal guaranteed payoff in this case differ from those in the section 4 only by the multipliers zj - the components of Frobenius eigenvector of matrix A. So, the interests of players are completely compatible as in the problem without the conditions of homeostasis.

6. Conclusion

We studied difference and differential Stackelberg game theoretic models of opinion control in marketing networks in which the principal should provide that a given limit value of the summary opinion not be exceeded. First, the problem is solved for the stochastic influence matrix A, and then this strong assumption is substituted by a weaker standard assumption that the matrix A is stochastic from the left. It is possible to introduce a weaker assumption, namely that a non-negative influence matrix has a strictly positive Frobenius eigenvector. In this case, it is only required to introduce in the received formulas the Frobenius eigenvalue of the influence matrix as a multiplier. It appears that for the principal it is not advantageous to allocate to the forms more resources than they need in their rational behavior. In all cases in frame of the considered model the interests of players are completely compatible.

Other formulations of the homeostasis conditions are possible, for example, x* < J2jj=i xj < x*· But *n the considered model, when all state variables can only increase their values or remain constant, the constraints from below are satisfied automatically.

Another formulation can use the constraints only for the terminal values of the variables, i.e. x* <Σj=1 xj < x*, where xj are the values of the state variables in the end of the game. However, the satisfaction of these conditions in any instant along the game implies their satisfaction in the final instant also. The inverse implication is also true because the values of the state variables do not decrease during the game.

At last, the constraints can bound each state variable separately: x*j < xj < x*j or x*j < xj < x*j. In such formulation it is necessary to evaluate in which instant of time which variable reaches its boundary value. Starting from these instants, the control impact must be exerted only to the remaining state variable which decreases the problem dimension.

References

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Agieva, M.T., Korolev, A.V., Ougolnitsky, G. A. (2019). Modeling and Simulation of Impact and Control in Social Networks. In: Modeling and Simulation of Social-Behavioral Phenomena in Creative Societies. First International EURO Mini Conference, MSBC 2019 Vilnius, Lithuania, September 18-20, 2019 Proceedings. Springer, 2019. P. 29-40. Chkhartishvili, A., Gubanov, D., Novikov, D. (2019). Social Networks: Models of Information Influence, Control, and Confrontation. Springer Publishers.

Leontief, W. (1987). Input-Output Analysis. The New Palgrave. A Dictionary of Economics, edited by J. Eatwell, M. Milgate, and P. Newman, vol. 2, pp. 860-864. London. Macmillan.

Nisan, N., Roughgarden, T., Tardos, E., Vazirani, V. (2007). Algorithmic Game Theory. Camdridge University Press.

Ougolnitsky, G. (2011). Sustainable Management. N.Y.: Nova Science Publishers.

Sedakov, A., Zhen, M. (2019). Opinion Dynamics Game in Social Network with Two Influence Nodes. Vestnik SPb Gos. Univ. Ser. Appl. Math. Inform. Sci. Contr. Proc., 15(1), 118-125.

Sraffa, P. (1960). Production of Commodities by Means of Commodities. Prelude to a Critique of Economic Theory. Cambridge: Cambridge University Pres.

Sraffa, P. (1962). Production of Commodities by Means of Commodities: A Comment. Economic Journal, 72, 477-479.

Sukhinov, A. I., Ougolnitsky, G. A., Usov, A. B. (2020). Methods of Solving the Theoretic Game Models for Coordinating Interests in Regulating the Fishery Industry. Mathematical Models and Computer Simulations, 12(2), 176-184.

Zhen, M. (2019). Stackelberg Equilibrium in Opinion Dynamics Game in Social Network with Two Influence Nodes. Contributions to Game Theory and Management. Vol. XII. Collected Papers of the Twelfth International Conference "Game Theory and Management", Petrosyan, L., and Zenkevich, N., Eds., St. Petersburg: Saint Petersburg State University, 366-386.

i Надоели баннеры? Вы всегда можете отключить рекламу.