УДК 519.213
Set Functions and Probability Distributions of a Finite Random Sets
Natalia A. Lukyanova* Daria V. Semenova^
Institute of Mathematics and Computer Science Siberian Federal University Svobodny, 79, Krasnoyarsk, 660041
Krasnoyarsk State Medical University Partisan Zheleznyak, 1, Krasnoyarsk, 660022
Russia
Elena Е. Goldenok*
Institute of Economics and Commerce Siberian Federal University Prushinskaya, 2, Krasnoyarsk, 660075
Krasnoyarsk State Medical University Partisan Zheleznyak, 1, Krasnoyarsk, 660022
Russia
Received 08.11.2016, received in revised form 10.01.2017, accepted 08.04.2017 This paper is the investigation of the probability distributions of a finite random set in which the set of random events are considered as a support of the finite random set. These probability distributions can be defined by six equivalent ways (distributions of the I-st — VI-th type). Each of these types of the probability distributions is the set function defined on the corresponding system of events. In this paper the sufficient conditions are formulated and proved. When these conditions are satisfied, then the set function determines the probability distributions of the finite random set of the II-nd and the V-th type. The found conditions supplement the known necessary conditions for the existence of the probability distributions of a finite random set of the II-nd and the V-th type.
Keywords: finite random set, set function, probability distribution. DOI: 10.17516/1997-1397-2017-10-3-362-371.
Introduction
The problems of modeling of random objects of non-numeric nature are actual in multivariate data analysis. Such problems arise in medicine, for example, during forecasting the possible complications in patient according to the set of diagnostic symptoms; in Economics — for a research of the consumer demand; in fire safety — for the description of process of accidental spread of a forest fire etc. One possible approach to solving these problems is to use the apparatus of random sets.
The random element with values from set of all subsets of a particular finite set M is called a finite random set. The set M is called a support of a finite random set. In role of the support can be a finite set of objects of numerical or non-numerical nature. Finite random sets with different
* nata00sfu@gmail.com tdariasdv@gmail.com ^ egoldenok@gmail.com © Siberian Federal University. All rights reserved
types of supports were studied in works by I. Molchanov [1], H. T. Nguyen [2], O. Yu. Vorobyev [3], A. I. Orlov [4]. In this paper the finite random sets with support in the form of a set of random events are considered. Finite random sets with such support can act as a mathematical model of complex objects and systems that are described by a finite number of attributes. The appearance of any of these attributes is represented as a random event.
Procedures for constructing finite random sets with given distribution laws are demanded at random-set modeling of complex objects and systems. About twenty probability distributions of finite random sets are already known and used in practice. However, the extension of application area of random-set modeling requires the finding out new distributions. One of approaches for constructing new probability distributions of finite random sets is offered in works [5] - [7], [8]. This approach is based on the construction of a set functions and identify the conditions where these functions are probability distributions of the finite random set.
It is known that probability distribution of a finite random set can be defined by six equivalent ways (distributions of the I-st-VI-th type). Each of these types of the probability distributions of a finite random set is the set-function defined on the corresponding system of events. In this paper the sufficient conditions are formulated and proved. When these conditions are satisfied, then the set- functions determines the probability distributions of the finite random set of the II-nd and the V-th type. The found conditions supplement the known necessary conditions for the existence of the probability distributions of a finite random set of the II-nd and the V-th type that were proven in the works [3, 9].
1. Finite random set
The concept of the random element is used for description of finite random set [10].
Definition 1. Let (q, F, P) be the probability space, and (U, A) be the measurable space, where U is the arbitrary set, and A is some a-algebra of its subsets. We'll say that the function K = K (w) defined on Q and taking values in U is the F/A-measurable function or random element (with values in U), if {w : K(w) £ A} £ F is true for any A € A.
If we consider some space (U, A) elements of which there are sets, then the measurable mapping K of family of elementary events of arbitrary probability space (Q, F, P) in (U, A) is called a random set [2, 4]. Let M be the finite set and let (U, A) = (2M, 22^. Then the
measurable function K : Q ^ 2m mapping the probability space into the space of subsets of a finite set M, is called the finite random set and the set M is called a support of K.
This paper investigates the specific random sets. The finite sets of random events X C F (|X| < to), selected from the algebra F of the probability space (Q, F, P), are supports of these random sets.
Definition 2. The finite random set K defined on the support X C F represents a mapping K : Q ^ 2x measurable with respect to a pair of algebras ^F, 22 j that in the sense K_1(A) € F
is true for every A € 22 .
The formula K (w) = {x € X : w € x} can be interpreted as a "random set of the occured events" since for every elementary event of the experiment w € Q is corresponded a subset of the events X C X containing all events which occurred in this experiment.
The {w : K(w) = X, X C X} € F, is a measurable event. This means the finite random set K gets one of its possible values from X C X in other words the occured random events from X
form the subset X C X and the no-occured random events form the subset Xc = X \ X : {w : K (w) = X, X C X} = ( H ^W P| xA C q.
\x£X J \x£Xc J
For simplicity, instead of {w : K(w) = X, X C X} will write {KX = X}, where the subscript X at K means that the finite random set K defined on the support X. The set of all these events generated by the set of the events X forms a partition of the space of elementary events
Q= E {Kx = X}.
XCX
2. Types and properties of probability distributions of the finite random set
Probability distributions of finite random sets are the set functions which are defined on various systems of the random events generated by a set of X.
Definition 3. The set function f (X), X G 2X defined on the 2X of some arbitrary finite set X is the mapping f :2x ^ r with the values on the real axis r.
Definition 4. The set function f defined on the 2x is called
the additive if f (X U Y) + f (X n Y ) = f (X) + f (Y), V X,Y C X; (1)
the subadditive if f (X U Y) + f (X n Y) < f (X) + f (Y), V X,Y C X; (2)
the superadditive if f (X U Y) + f (X n Y) > f (X) + f (Y), V X,Y C X. (3)
Let KX be the finite random set defined on support |X| = N < w,where X C F.
Definition 5. The probability distribution of the finite random set Kx is called the set of 2n values of the probability measure P defined on the system of events 2x.
Any one of the six types of distributions (I-st-VI-th types) is enough for a complete determination of the probability distribution of the finite random set KX. The paper deals with the following three types.
PI. The probability distribution of the I-st type of the finite random set KX is the set function defined on the system of disjoint events of the following form {Kx = X} =
nx)n(n x°\]
xex / \xexc / /
where Xc = X \ X, xc = q \ x. It is the set of 2n probabilities
of the form p(X) = P({KX = X}). The probability distribution of the I-st type satisfies to the condition ^^ p(X) = 1.
XCX
PII. The probability distribution of the II-nd type of the finite random set KX is the set function defined on the system of joint events of the following form {KX D X} =1 P| x I . It is
\x£X )
the set of 2n probabilities of the form pX = P({KX D X}). The probability distribution of the II-nd type satisfies to the system of Frechet inequalities for the probabilities of the intersection of events [3]:
maxi P(x) -\X | + l\ < P I Q x 1 < min P(x), (4)
I xex J \xex J xeX
where P(x) = P({x G KX}), x G X, X C X, are probabilities of the events themselves.
PV. The probability distribution of the V-th type of the finite random set KX is the set function defined on the system of joint events of the following form {KX ^ Xc} =1 y x I . It is
\x£X J
the set of 2N probabilities of the form ux = P({Kx £ X= 1 - P({Kx C X. The probability distribution of the V-th type satisfies to the system of Frechet inequalities for the probability of the union of events [3]:
max P (x) < P ( |J x ) < min i 1, Z P .
xEX \xex ) I xex J
1, £ P (x)\ . (5)
xex )
For brevity, we'll use the following abbreviations: p{x} = px, p({x}) = p(x), u{x} = ux,
P{x,y} = pxy, p({x, y}) = P(xУ), u{x,y} = uxy etc.
Let's give an explanation about the symbol of the empty set, to avoid terminological confusion in the future. In the Boolean of the finite set X designation «0» is used for Xc = Q \ X, and the designation of an empty set «0» is used for treatment of an impossible event. By the definition, we accept
P|x = P| x = Q, pxc =P xc = Q, (Jx ^y x = 0. xe0 xeXc xe0 xeXc xe0 xeXc
Then the event {Kx = 0} = Plx R! P|xM = p| xc means that no one event from
\\xe9 J \xex J J xex
the X didn't occur.
For the probability distribution of the II-nd type the next equality
p® = P({kx 2 0}) = P(( H x] nt p ^ = p( p Qj = P(Q) = 1
\ \xeXc J \xex J J \xex J
is true always, and for the probability distribution of the V-th type the equality
u = P({kX ^ X}) = 1 - P({kX C X}) = P ( y ^ = P(0) = 0
is true always.
In works [3, 9] were obtained the inversion formulas for the probability distributions of the finite random set Kx which follow from the classical Mobius inversion formulas for functions. The probability distributions of the I-st and the II-nd types for the finite random set Kx are associated with help of mutually inverse Mobious formulas for all X e 2X :
px = Z p(Y ), (6)
Ye2X: xCY
P(X ) = Z (-1) ' Y ' " ' x ' Py . (7)
Ye2X: xCY
The probability distributions of the I-st and the V-th types for the finite random set Kx are associated with help of mutually inverse Mobius formulas for all X e 2X :
ux = 1 - Z p(Yc), (8)
Ye2X: xCY
P(X)= Z (-1)' x ' "' Y '(1 - UYc). (9)
Ye2X: YCx
Let us prove the properties of the set-functions (1)-(3) for the corresponding probability distributions of the finite random set.
Lemma 1. The probability distribution of the I-st type of the finite random set KX satisfies the property (1), i.e. is an additive set function on 2X.
Proof. The set of 2n events {KX = X} generated by the finite set of random events X forms the finite partition of Q. Because the {KX = X} n {KX = Y} = 0 then P({KX = X} n {KX = Y}) = 0. Therefore, the equality P({Kx = X} U {kx = Y}) + P({Kx = X} n {kx = Y}) = P ({Kx = X}) + P ({kx = Y}) is true for any X,Y C X. □
Lemma 2. The probability distribution of the Il-nd type of the finite random set kx satisfies the property (3), i.e. is a superadditive set function on 2X
Proof. For the probability distribution of the II-nd type the property (3) takes the form
PXUY + PXnY > PX + PY, X, Y C X. (10)
Let us express in (10) the probability of the II-nd type through the probability of the I-st type using the Mobius inversion formula (6). And shall prove that
£ p(Z)+ £ p(Z) > £ p(Z)+ £ p(Z), Z G 2x. (11)
XnY CZ XUY CZ XCZ YCZ
Let's note that for Z G 2x
£ p(Z)= £ p(Z)+ £ p(Z) - £ p(Z)+ £ p(Z), (12)
XnYCZ XCZ Y CZ XUY CZ XnY CZ,
X<Z; Y<Z
where the summation in the last sum applies for all such intersections X n Y C Z, that X ^ Z and Y ^ Z. Since the sum of (12) is non-negative then we obtain the equality
£ p(Z)+ £ p(Z)= £ p(Z)+ £ p(Z)+ £ p(Z), z g 2x,
XnY CZ XUY CZ XCZ Y CZ XnYCZ,
X<^Z; Y<£Z
from which the proof follows of (11). □
Lemma 3. The probability distribution of the V-th type for the finite random set kx satisfies the property (2), i.e. is a subadditive set function on 2x.
Proof. For the probability distribution of the V-th type the property (2) takes the form
uxuy + UXnY ^ ux + uy, X,Y C X. (13)
Let us express in (13) the probability of the V-th type through the probability of the I-st type using the Mobius inversion formula (8). And shall prove that
i - £ p(zc) + i - £ p(zc) <
Ze2X: XnY CZ Z£2X: XUY CZ
< i - £ p(Zc) + i - £ p(Zc).
Ze2X: XCZ Ze2X: YCZ
Here
Z p(Zc)+ Z p(zc) > Z p(zc)+ Z p(zc).
Ze2X: x nY CZ Ze2X: xuY CZ Ze2X: xCZ Ze2X: Y CZ
Because ^^ p(Zc) = ^^ p(Z), then the last inequality has the form
Ze2X: xCZ Ze2X: ZCxc
Z p(Z)+ Z p(Z) > Z p(Z)+ Z p(Z), Z e 2X. (15)
ZC(xnY )c ZC(xuY )c ZCxc Z CY c
It is not difficult to prove that set function f (Xc) = ^^ p(Z) is the superadditive if the
Ze2X: ZCxc
f(X) = p(Z) is the superadditive. Let us show that
Ze2X: ZCx
Z p(Z)+ Z p(z) > Z p(z)+ Z p(z), Z e 2X. (16)
ZCxnY ZCxuY ZCx ZCY
Let's note that for Z e 2X
Z p(Z)= Z p(z)+ Z p(z) - Z p(z)+ Z p(z), (17)
ZCx nY ZCx ZCY ZCxuY zcxnv;
Z^X; Z^Y
where the summation in the last sum applies to all such intersections Z C X n Y, that Z ^ X and Z ^ Y. Since the sum of (17) is a non-negative then we obtain
Z p(z)+ Z p(z)= Z p(z)+ Z p(z)+ Z p(z), Z e 2X,
ZCxnY ZCx UY ZCx ZCY zcxnY;
Z^X; Z^Y
from which implies the validity of (16). hence it proves (15). □
From Lemmas 1-3 implies that the choice of this or that system of events determines properties of additivity (1)-(3) of considered probability distributions of the finite random sets. Note that only the probability distribution of the I-st type satisfies the probabilistic normalization because it is defined on the complete group of disjoint events {KX = X}. Probabilistic normalization of the probability distributions of the II-nd and tthe V-th types defined on systems of a joint events {KX 2 X} and {KX ^ Xc} respectively isn't realized.
3. Sufficient conditions for the existence of probability
distributions of a finite random set of the II-nd and the V-th type
The following obvious lemma allows to consider the set function f(X) defined on system of events {KX = X} as distribution of the I-st type.
Lemma 4 (Sufficient conditions for the existence of probability distribution of the I-st type). The set function f : 2X ^ [0, 1] defined on the system of events {Kx = X} is the probability distribution of the I-st type of the finite random set Kx, if the set function satisfy to the condition
E f (x ) = 1.
xCX
The set function f (X) defined on the system of events {Kx D X} which values satisfy to the Frechet bounds (4) does not always determine the probability distribution of the Il-nd type of the finite random set. Since the transformation f (X) by formulas (7) can lead to a set function p(X) defined on the system of events {Kx = X} with values not necessarily from [0, 1].
Theorem 1 (Sufficient conditions for the existence of the probability distribution of the Il-nd type). The set function f : 2x ^ [0, 1] defined on the event system {Kx D X} is the probability distribution of the Il-nd type of the finite random set Kx, if simultaneously following conditions are satisfied:
(i) f (0) = P({Kx D 0}) = 1;
(ii) f (x) = P({x £ Kx}) are the values of the set function on events x £ X coincides with the probabilities of these events;
(iii) values of the set function satisfy to the system of the Frechet inequalities for the probabilities the intersection of events for all X £ 2x, \X\ ^ 2,
max
jo,i - £(i - f(x))| < f(x) < mxf(x);
I xex ) x
(iv) values of the set function
P(X) = £ (-1) 1 Y 1 - 1 X 1 f (Y), X £ 2x, (18)
Y £2X :XCY
derived from the f (X) by the inversion Mobius formula are non-negative.
Proof. Terms (i) - (iii) are necessary conditions for the existence probability distribution of the Il-nd type of finite random set Kx. However, they are not sufficient. In the works [3, 9] it is shown that the formula (18) transforms the set function defined on the system of events {Kx D X} in the set function defined on the system of events {Kx = X}. Let's prove that the result of this transformation is the distribution of the I-st type.
Nonnegativity of p(X) follows from the condition (iv). Let's replace in (18) the summation over all X C Y, Y £ 2x on a double sum: the inner sum over X C Yn, where Yn are subsets of fixed power \Yn\ = n, and the outer summation over the possible values of power n = \X\,..., \X\. Then (18) takes the form
x x
p(x)= E E(-1)n- 1 x 1 f(Yn)= Y.(-1)n- 1 x 1 E f(Yn).
n=| X \XCYn n=| X | X CYn
Let's prove that the sum of all values of p(X) is equal to unity:
x
E p(x) = £ IE (-Dn- 1 X1 E f (Yn)
XCx XCx \n=|X| XCYn
x
(19)
= f (0) + £ IE cm (-1)n-m E f (y )) = f (0) = 1,
n=l \m=0 YeCX J
where
cx = {Y C X : \Y\ = n} is the n-layer of set X [3, CTp. 50];
-im t -t\n-m, i It n — 0;
n (
(-Dn-m — 0, n> o.
m=0 ^
According to Lemma 4 the set function p(X) determines the probability distribution of the I-st type of the finite random set Kx. Set functions which are probability distributions of the I-st and the Il-nd types for the finite random set Kx are coupled mutually by inversion Mobius formulas (6) and (7). Thus if the set function p(X) determines the probability distribution of the I-st type, then f (X) is a probability distribution of the Il-nd type. □
The set function f (X) defined on the system of events {Kx £ Xc}, which values satisfy to the Frechet bounds (5), also does not always determine the probability distribution of the V-th type of the finite random set.
Theorem 2 (Sufficient conditions for the existence of the probability distribution of the V-th type). The set function f : 2x ^ [0, 1] defined on the event system {Kx £ Xc} is the probability distribution of the V-th type of the finite random set Kx, if simultaneously following conditions are satisfied:
(i) f(0) — P({Kx £ Xc})—0;
(ii) f (x) — P({x £ Kx}) what means that values of the set function on events at x £ X coincide with probabilities of these events;
(iii) values of the set function satisfy to the system of Frechet inequalities for probability of union of events for all X £ 2x, \X\ > 2
max f (x) < f (X) < min < f (x)
x xex
oiJ i, E f (x) f;
I xex )
(iv) values of the set function
P(X)— E (-1) ' X ' - ' Y '(1 - f (YC)), X £ 2x, (20)
Y e2X:Y CX
derived from f (X) by the inversion Mobius formula are non-negative.
Proof. Terms (i) - (iii) are necessary conditions for the existence probability distribution of the V-th type of finite random set Kx. However, they are not sufficient. In the works [3, 9] it is shown that the formula (20) transforms the set function defined on the system of events {Kx £ Xc} in the set function defined on the system of events {Kx — X}. Let's prove that the result of this transformation is the distribution of the I-st type.
Nonnegativity of p(X) follows from the condition (iv). Let's replace in (20) the summation over all Y C X, Y £ 2x on a double sum: the inner sum over Yn C X, where Yn are subsets of fixed power \Yn\ — n, and the outer summation over the possible values of power n — 0,..., \X\. Then (20) takes the form
' x ' ' x '
p(x) —£ E (-1) ' x ' -n(1 - f(Yc)) — £(-1)' x ' (1 - f(Yc)).
n=0 YnCx n=0 YnCx
Let's prove that the sum of all values of p(X) is equal to unity:
E p(x) — E |E(-1)' x ' -n E (1 - f(yc))j — 1. (21)
xex xex \n=0 YnCx )
Indeed, for a finite random set defined on the support of the unit capacity X = {x} the formula (21) takes the form
£ p(X) = p(®) + p(x) = X C{a}
= (-i)0-0 (i - f (x)) + ((-i)1-0 (i - f (x)) + (-1)1-1 (i - f m)) = = (co (-i)0-0+ci (-i)1-0) (i - f (x))+coo (-1)1-1 (i - f m) = i.
All summands are mutually removed, except the last, which gives as a result unit. For X = {x,y} the formula (21) takes the form
2
£ p(X) = p(%)+ p(x) + p(y)+ p(xy) = £ cm— (-i)m-0 (i - f (xy)) +
J2-
XCX m=0
2
+ £ c—1 (-i)m-1 £ (i - f(Yc))+C2— -if-2 (i - f(9)) =
m=1 Y
= £ I £ c2m—ni (-i)m-n £ (i - f(yc)) ] = i.
n=0 \m=n YeCX
Similarly, for any finite set X the formula (21) takes the form
£ p(x)= £ (£(-i)|X|-n £ (i - f(YO)
XCX XCX \n=0 YnCX
|X| / |X|
\X\-n '
£ c?X-nn (-i)m-n £ (i - f (y c)) I = co (-1) | X | - | X | (i - f (9)) = i,
n=0 \ m=n
where
CX = {Y Ç X : \Y| = n} - n is the layer of the set X;
|X| |X|
E\ ' cm-n (_i)m-n _ J i, \X\ — n;
Z^ ^^n,( ) _ I o, otherwise.
=0 m=n
According to Lemma 4 the set function p(X) determines the probability distribution of the I-st type of a finite random set KX. The probability distributions of the I-st and the V-th types of the finite random set KX are the set functions. These functions are linked each other by the mutually inversion Mobius formulas (8) and (9). Thus if the set function p(X) determines the probability distribution of the I-st type, then f (X) is a probability distribution of the V-th type. The theorem is proved. □
Conclusion
Sufficient conditions are formulated and proved in this paper. These conditions allow to consider the set function as the probability distribution of the finite random set of the Il-nd and the V-th type. The found conditions supplement the known necessary conditions for the existence of probability distributions of the finite random set of the Il-nd and the V-th type. These results extend the area of application of the random-set modeling of socio-economic, environmental and medical systems.
References
[1] I.Molchanov, The Theory of Random Sets, Springer, New York, 2011.
[2] H.T.Nguyen, An Introduction to Random Sets, Taylor & Francis Group, LLC, 2006.
[3] O.Yu.Vorobyev, Eventology, Siberian Federal University, Krasnoyarsk, 2007 (in Russian).
[4] A.I.Orlov, Stability in the socio-economic models, Moscow, Nauka, (1979) (in Russian).
[5] N.A.Lukyanova, D.V.Semenova, Associative Frank functions in constructing families of discrete probability distributions of random sets of events, Zh. prikladnoi diskretnoi matematiki, 32(2016), no. 2, 5-19 (in Russian).
[6] D.V.Semenova, N.A.Lukyanova, Recurrent formation of discrete probabilistic distributions of random sets of events, Zh. prikladnoi diskretnoi matematiki, 26(2014), no. 4, 47-58 (in Russian).
[7] N.A.Lukyanova, D.V.Semenova, The study of discrete probabilistic distributions of random sets of events using associative function, J. of Siberian Federal University. Math. & Phys., 7(2014), no. 4, 500-514.
[8] D.V.Semenova, N.A.Lukyanova, Formation of Probabilistic Distributions of RSE by Associative Functions, Communications in Computer and Information Science, Springer International Publishing, Switzerland, 487(2014), 377-386.
[9] O.Yu.Vorobyev, A.O.Vorobyev, Summation of the set-additive functions and Mobius inversion formula. Doklady Ross. Akad. Nauk, 336(1994), no. 4, 417-420 (in Russian).
[10] A.N.Shiryaev, Probability-1, Moscow, 2004 (in Russian).
Функции множества и распределения вероятностей конечных случайных множеств
Наталья А. Лукьянова Дарья В. Семенова Елена Е. Голденок
Сибирский федеральный университет Свободный, 79, Красноярск, 660041
Красноярский государственный медицинский университет Партизана Железняка, 1, Красноярск, 660022
Россия
В 'работе исследуются распределения вероятностей конечного случайного множества, носителем которых выступает множество случайных событий. Данные распределения вероятностей можно задать шестью эквивалентными способами (распределениями I-VI рода). Каждый из этих типов распределений вероятностей является функцией множества, заданной на соответствующей системе событий. В работе сформулированы и доказаны достаточные условия, при выполнении которых функция множества определяет распределение вероятностей конечного случайного множества II и V рода. Найденные условия дополняют известные необходимые условия существования распределения вероятностей конечного случайного множества II и V рода.
Ключевые слова: конечное случайное множество, функция множества, распределение вероятностей.