УДК 519.24
A Class of Special Empirical Processes of Independence
Abdurahim A. Abdushukurov* Leyla R. Kakadjanova^
Dpt. Probability Theory and Mathematical Statistics National University of Uzbekistan VUZ Gorodok, Tashkent, 100174
Uzbekistan
Received 12.12.2014, received in revised form 13.02.2015, accepted 12.03.2015 In this paper we investigate the asymptotic properties of one class of empirical processes for certain classes of integrable functions.
Keywords: empirical processes, metric entropy, Glivenko-Cantelli theorem, Donsker's theorem.
Introduction
In this paper we investigate the limit properties of a class of empirical processes of independence indexed on a set of measurable functions. The necessity of considering such processes stems from practical situations where we are interested in joint properties of pairs consisting of random variables (r.v.-s) and events.
Let us consider the following sequence of experiments in which observed pairs are consisted of {(Xk, Ak), k > 1}, where Xk are random elements defined on a probability space (Q, A, P) with values in a measurable space (X, B). Events Ak have a common probability p G (0,1). Let Sk = I (Ak) be the indicator of the event Ak. At the n — th step of experiment is observed the sample S(n) = {(Xfc, 4), 1 < k < n}. Each pair in the sample S(n) induces a statistical model with the sample space X® {0,1}, sigma-algebra of sets of the form B x D and induces distribution Q* (B x D) = P (Xfc G B,4 G D), where B G B, D c {0,1}. Let us define submeasures Qi (B) = = Q* (B x {1}), Qo (B) = Q* (B x {0}) and Q (B) = Q* (B x {0,1}) = Qo (B) + Q1 (B), B G B. We also consider the hypothesis H of independence Xk and Ak for each k > 1. The validity of H can be tested by using the equations Q1 (B) = pQ (B) or Q0 (B) = (1 — p) Q (B) for any B G B. We define the measures A (B) = Qi (B) — pQ (B), B G B. Thus, under the hypothesis H : A(B) = 0, for any B G B. Let us define the empirical measures for all B G B:
1 n
Qin (B) = -V skI (Xk g B),
ri < *
n
fc=l
1 П
Qon (B) = -V(1 - 4K(Xfc G B),
n
k=i
i n
Qn (B) = - У) I (Xfc G B) = Qon (B) + Qin (B)
Tí --
n
fc=i
*^[email protected] [email protected] © Siberian Federal University. All rights reserved
These measures are empirical estimates for Qi, Q0 and Q respectively. Since p = Q1 (X) then
1 "
estimate for p is pn = Q1n (X) = — ^fc. According to the strong law of large numbers (SLLN)
n k=i
for a fixed B when n ^ to, Qjn (B) ^ Qj (B), j = 0,1 and consequently, Qn (B) ^ Q (B) and p«8^' p. Thus, for each B e B at n ^ to, An (B) = Q1n (B) - pnQn (B) ^ A (B) and under validity of H, An (B) 0. Thus we are naturally led to the study of limit properties of processes of independence {An (B) — A (B)} for a certain class G sets of B. In this paper we consider general classes of specially normalized empirical processes of independence indexed by a class of measurable functions.
1. Empirical processes of independence
Suppose that F be a set of measurable functions f : X ^ R. For the signed measure G and function f e F we define the integral
Gf = f fdG. ■Jx
Let us define F is indexed empirical process Gn : F e R as:
n
f ^ Gnf = vn(Qn — Q) f = (f (Xk) — Qf), f e F.
fc=1
Note that G«f = Go«f + Gi«f, where {Gj«f = ^n (Qjn — Qj) f, j = 0,1, f e F} is subempirical processes. According to the SLLN and the central limit theorem (CLT) and under conditions Q |f | < to, Qf2 < to for the given function f we have
QnA8 Qf, Gnf ^ N (o, Q(f — Qf)2) . (1)
Uniformly variants for f e F in statements (1) have well-developed theory. The generalized analogues of classical Glivenko-Cantelli theorem and Donsker's theorem for F-indexed empirical processes can be found in [1-7]. One should mention the special case when F is the set of indicators of a class G of sets B:
F = {I (B): B e G} . (2)
It is easy to see that in this case {Gnf = Gn (B) = A/n(Qn(B) — Q(B)), B e G} and this process is called as G-indexed. An example of such process is the classical empirical process obtained by X = Rm, G = {(—to,x] : x e Rm} , Q ((—to,x]) = H (x) and Qn ((—to,x]) = Hn (x) as {Gn ((—to, x]) = Vn (Hn (x) — H (x)), x e Rm} .
Let us return to general F-indexed processes {Gnf, f e F} and recall that there are various variants of the Glivenko-Cantelli theorem based on the theory of metric entropy under certain conditions on the set of measurable functions F. These conditions ensure that ||G«||f = sup{|Gnf | : f e F} converges in probability to zero or it almost surely converges to zero. Such classes F are called the weak or strong Glivenko-Cantelli classes, respectively. Donsker-type theorems provide general conditions on F under which
Gnf ^ Gf in 1TO(F), (3)
where 1TO(F) is the space of all bounded functions f : X ^ R equipped with the supremum-norm |f and ^ means the weak convergence (see [6], p. 81).
Class F for which convergence (3) holds is called a Donsker class. Limiting field {Gf, f G F} called Q-Brownian bridge. It is a tight Borel measurable element of 1TO(F) and it is a Gaussian field with zero mean and covariance function
EGfiGf = Q (fi - Qfi) (f2 - Qf2) = Qfif2 - QfiQf2. (4)
Q-Brownian bridge {Gf, f GF} can be represented in terms of Q-Brownian sheet {W (f), f GF}
as
Gf = W (f) - W (1) Qf, f gF, (5)
where EW (f) = 0, EW (fi) EW (f2) = Qfif2 and W (1) is the value of Q-Brownian sheet for f = 1.
In connection with the problem of testing the hypothesis H, we introduce F-processes
Af = Qif - pQf, Af = Qinf - pnQnf, f GF. (6)
Let us note that for the given function f, when n ^ to, Qj |f | < to, j = 0,1, we have Anf ^ Af in accordance with SLLN and under validity of H, Af = 0. It is easy to see that for the fixed f, variable y/n (An - A) f is a linear functional of subempirical processes provided that Qjf2 < to, j = 0,1, and it has the limit normal distribution with zero mean. In this paper we propose and study the following F-indexed normalized process in order to test the hypothesis H:
Af = f fdA„ = n-/ (An - A) f, f G F. (7)
JX VP" (1 - Pu)J
Process (7) has the important property: it converges to the same Q-Brownian bridge {Gf , f G F} under validity of H. Certain of the results presented in this paper can be found in reports [8-11].
2. Asymptotical results
Let Lq (Q) be the space of functions f : X ^ R with the norm
IQ,q = (Qlf ) /q = { j If |qd ' ^
To prove the F-uniform variants of Glivenko-Cantelli theorem and Donsker's theorem we define the complexity or entropy of class F. To determine the entropy it is necessary to define the concept of e-brackets. The e-bracket in Lq(Q) is a pair of functions y, ^ G Lq(Q) such that Q (y(X) < ^(X)) = 1 and - y|Q q < e, i.e. Q(^ -y)q < eq. Function f G F is in (or covered by) bracket [y, -0], if Q (y(X) < f (X) < ^(X)) = 1. One should note that the functions y and ^ may not belong to the class F, but they must have finite norms. Bracketing (or covering) number N[] (e, F, Lq (Q)) is the minimum number of e-brackets in Lq(Q) needed to cover F (see [1-7]):
. jk : forsome fi,...,fk GLq (Q) ,
N[] (e,F, Lq (Q)) = mm {F c U [fi, fj] : ||fj - fi|Qiq < e.
Number Hq (e ) = logN[] (e, F, Lq (Q)) is called the metric entropy with bracketing of the class F in Lq(Q). Number Hjq (e) = logN[] (e, F, Lq (Qj)), j = 0,1 denotes the metric entropy
of a class F in Lq (Qj), j = 0,1, respectively. To prove the weak convergence of F-indexed
empirical processes (7) we introduce the integral of the metric entropy with bracketing as
, '-q^jj) =
rs
j (S) = Jj[] № F ; £ (Qj )) = J (Hjq (e))i/2de, j =0,1, for 0 < S < 1.
Recall that numbers N[\ (•) converge to +to at e j 0. However, it is necessary for Donsker's theorem that they converge not very fast to +to. This speed is measured by the integrals jj^ (J) (see [6,7]).
The following theorem shows validity of Glivenko-Cantelli type theorem for the process {Anf, f e F}. Here sign * means a.s. convergence by outer probability.
Theorem 2.1. Let the class F such that
N[\ (e, F, Li (Qj)) < to, j = 0,1. (8)
Then under validity of the hypothesis H and at n ^ to
n-i/2Anf
*
* a.s.
F (9)
Proof. According to SLLN when n ^ to, pna's'p G (0,1). Therefore, convergence of (9) is
yan/iif ^ °, n ^ to. (10)
equivalent to
IA f
If hypothesis H is valid, then it is easy to verify that
l|Anf If < ll(Qin — Qi) f If + pnll(Qn — Q) f IIf + IIQf If • |pn — pi <
< 2|(Qin — Qi) f If + ll(Qon — Qo) f If + If IIq,i • Ip« — pI , (11)
where
iq,i I |f |dQ < f |f I dQi + / |f | dQo =||f |Qlii + If |Qo,i < to. (12)
'X JX JX
Under conditions (8) F is a Glivenko-Cantelli class with respect to measures Qj, j = 0,1. Hence, by Theorem 19.4 in [7] for each e > 0:
limsup ( sup |(Qjn - Qj) f |) < e. (13)
n^œ \ff eF J
Now relations (10) and (9) follow from (11)-(13). Theorem is proved. □
To prove the weak convergence of process (7) to a Gaussian process, we first investigate the limiting properties of two-dimensional empirical field {(Anf, Aing), f, g G F}, where Anf = n1/2 (Qn - Q) f and Aing = n1/2 (Qin - Qi) g. Theorem 2.2. Let the class F such that
F C £2 (Qj) and J2 (1) < to, j = 0,1. (14)
Then for n ^ to sequence {(Anf, Aing) , f, g G F} of F ^ R2 maps weak converge in lœ (F) x lœ (F) to the two-dimensional Gaussian field {(Af,Aig) , f, g G F} with zero mean and the following covariance structure for f, g G F:
E (Af • Ag) = Qfg -« E (Aif • Aig)= Qifg - QifQig, (15)
E (Af • Aig) = Qi fg - Qf Qi g.
Proof. From the first condition in (14) it follows that for the fixed /, g G F : Q/j2 = Qo/j + Qi/2 < to and Qig2 < to, i = l,m. Then according to multidimensional CLT finite dimensional distributions of vector (An/,Aing) converge to multivariate Gaussian distribution with zero mean vector. Covariance matrix defined by structure (15) is the normalized sum of independent and identically distributed r.v.-s :
n
(A„/,Ai„g) = (/ (Xfc) - Q/, 4g (Xfc) - Qig).
k=1
It remains to prove tightness of (An/, A1ng). Under conditions (14) and n ^ to we have following Donsker's theorems (see [6]):
An/ ^ A/ in Z~ (F), Am/ ^ Ai/ in (F), (16)
where limiting processes are respectively Q- and Qi-Brownian bridges, i.e. tight Borel measurable elements of (F). Then the sequences of marginal distributions which induced by processes {An/, / G F} and {Ain/, / G F} are tight (see, Lemma 1.3.8 in [6]). Process {(An/,Aing) , /, g G F} is element of space (F) x (F) and by Lemma 1.4.3. in [6] also induces in this space the tight sequence of distributions. Theorem is proved. □
Remark. In formula (15) at g = l we have Qi l = p and
E (A/ • Ai1) = Qi/ - p Q/, / GF. (17)
Hence, when hypothesis H is valid then covariance (17) is equal to zero for all / G F. Thus under hypothesis H the Brownian bridge {A /, / G F} and r.v. = Ai1 with normal distribution N (0,p (1 — p)) are independent.
Let us introduce the empirical process |n1/2 (An — A) / = Gn/, / G f| . This process connected with process (7) by the following relation:
Gn/ =(pn (1 - pn ))1/2 • An/,/ GF. (18)
Process (18) plays a supporting role in study of basic process (7) which property of weak convergence to a Q-Brownian bridge is contained in the following statement.
Theorem 2.3. Under the conditions of Theorem 2.2 for n ^ to
An/ ^ A/ in 1TO(F), (19)
where {A/, / G F} is a Gaussian field with zero mean and under validity of the hypothesis H it coincides with Q -Brownian bridge.
Proof. We consider process (18) and represent it in the form Gn/ = Ain/ - pnAn/ - ^nQ/, where An/ = Aon/ + Ain/, Ajn/ = n1/2 (Qjn - Qj) /, j = 0,1; Mn = n1/2 (pn - p) = Ainl. It is easy to see that Gn/ is asymptotically equivalent (in terms of convergence to the same process) to the process Gn/ = Ain/ - pAn/ - ^nQ/. According to Theorem 2.2 for n ^ to
Gn/ ^ G0/ = Ai/ - pA/ - MoQ/ in (F). (20)
Let us note that process {G°/, / G F} is a linear functional of Gaussian processes. It is also a Gaussian process with zero mean and covariance which calculated with the use of (15) and (17) for /,g G F as
9
EG0/G°g = Y^ C, (21)
j=i
where
Ci = Qifg — QifQig; C2 = —p (Qifg — QfQig); C3 = — (1 — C4 = —p (Qfg — QgQif); C5 = p2 (Qfg — Qf Qg); Ce = —pQf C7 = — (1 — p) QgQif; Cg = —pQg (Qif — pQf); C9 = p (1 —
Under validity of the hypothesis H and taking into account the remark to Theorem 2.2 it is easy to verify that from (21) we have EG°f <G°g = p (1 — p) (Qfg — QfQg). Then [p (1 — p)]-1/2<G°f=<Gf. Thus we obtain a Q-Brownian bridge with covariance (4). Therefore, according to (18) for n — to
Anf ^ [p (1 — p)]-1/2G°f in Z~ (F) and when hypothesis H is valid then
Anf ^ Gf in Z~ (F).
□
Let us consider a generalization of Theorem 2.3 to the case of random sample size. Suppose that at n-th stage of observations a random number of observations from an infinite sequence of independent and identically distributed pairs (Xi, Ji), (X2, J2),... is available Here Nn is integer-valued nonnegative r.v. defined on the same probability space (Q, A, P). Let the sequence Nn converges to infinity in the strong sense that there is a r.v. v and at n — to
Nn —- v, (22)
Here P(v > 0) = 1 and Cn — to is a deterministic sequence of numbers. Let {ANn f, f e F} be a sequence of normalized empirical processes of independence obtained from (7) by replacing index n to a random sequence Nn. The following theorem shows that this process has the same limiting distribution as {Anf, f e F}.
Theorem 2.4. Under the conditions of Theorem 2.3 and (22) at n — to
An„ f ^ Af in Z~(F). (23)
Consequently, from Theorem 2.3 and (23) under validity of hypothesis H, distribution of Af coincides with the distribution of Q-Brownian bridge with covariance (4).
Proof is the consequence of Theorem 3.5.1 from [6] and Theorem 2.3 and hence details are omitted. □
Now suppose that {Nn, n > 1} a sequence of Poisson r.v.-s with the mean n and independent identically distributed r.v.-s (Xi, Ji), (X2, J2),... . Let us denote by {Anf, f e F} a normalized empirical process of independence obtained from (7) by replacing the upper bounds n in all summations to Nn . Next theorem shows that the limiting process is the Q— Brownian sheet as defined in (5).
Theorem 2.5. Under the conditions of Theorem 2.3 at n — to
Anf ^ Af in Z~(F), (24)
where by hypothesis H, A*f = W(f), f e F.
Proof follows from Theorems 3.5.1, 3.5.3 from [6] and Theorem 3.4 if we take into con-Nn p ; Nn
sideration that — —► 1, and processes AN / = ni/2(J] /(Xk) - nQ/) and A*N / =
n n k=i n
2 Nn
ni/2( J2 4/(Xk) - nQi/) have following standardized representations: k = i
kNn / = V —an
AN„ f = V^Ä^f + — - 1)Qf, n V n n
aîn„f = \/—Ä1N„f + vn(— - 1)Qif.
n V n n
The details are omitted. □
The results of Theorems 2.3-2.5 can be used to construct the statistics for testing the hypothesis H. For example, from processes (Anf, f G F}, (ANnf, f G F} and (Af, f G F} one can construct the following Kolmogorov-type statistics Kn = ||Anf, KNn = ||ANnf||f and ||Anf ||f which under validity of H have limiting distributions of r.v.-s K0 = ||Gf ||f and Kn = ||W(f , respectively.
3. Application to random censoring
Let us consider a right random censoring model, where Xj = min(Tj, Q} and Aj = (Tj < Cj}. Here r.v.-s Tj and Cj denote life times and censoring times. They are mutually independent with common continuous distribution functions F and G respectively (F(0) = G(0) = 0). Then considering data S(n) = ((Xj, Jj), 1 < i < n} with = I (Aj), r.v.-s of interest Tj are observed when Aj occurs, i.e., = 1. Take into account that Xj have common distribution function H =1 — (1 — F)(1 — G) and subdistributions defined as
Qo (B) = P (Xk G B,4 =0)= P (Ck G B n [0,Tk)) / (1 - F(t))G(dt),
J B
Qi (B) = P (Xk G B,5k = 1) = P (Tk G B n [0,Ck]) / (1 - G(t))F(dt).
B
(25)
Now we consider simple proportional hazards model (PHM) or Koziol-Green model which is very useful in practical applications (see, for example, [12-16]). In PHM we assume the parametric relation
1 - G = (1 - Ff for some p > 0. (26)
Taking into consideration (26), it is easy to see that 1 - F = (1 - H)p, where p = 1 + =
P(Ak). One of basic properties of PHM is that (26) holds when r.v.-s Xk and are independent. Such characteristic of PHM plays a basic role in constructing and studying estimators of many functionals of distribution F. The following sufficient maximum likelihood estimator of F was first introduced and studied [12-14]:
Fn(t) = 1 - (1 - Hn(t))pn, (27)
1 n 1 n
where Hn(t) = — 1(Xk < t) andpn = — ^fc are independent empirical estimators of H(t) nk=i nk=i and p, respectively.
There are many papers devoted to statistical analysis of Fn. These papers are concerned with the superiority of methods for estimation and the testing in PHM and methods are based on Fn rather than on the product-limit estimator of Kaplan-Meier. Some references can be found in [16]. Hence the question arises as to when the advantages of the PHM can be used. In other words, there is now a need for testing of validity of PHM, i.e., for the composite hypothesis described by relation (26). But this relation is equivalent to hypothesis H on independence of r.v.-s (Xi, ...,Xn) and (¿i,..., ¿n).
Let us consider the following special empirical process (7):
( n \i/2
An(t) = ( p (1 — p ) ) (Hin(t) — pnHn(t)), —to < t < to, (28)
1n
where Hin(t) = — 1(Xk < t, ¿k = 1). Then we have the consequence of Theorem 2.3: if H
n k=i
holds then as n — to
a« (•) ^ B (H (•)), (29)
where {B(y), 0 < y < 1} is a Brownian bridge. Several statistics for testing H were considered [13-15]. Note that these statistics are based on relation (29) and corresponding tests are consistent. Moreover, by Theorems 2.3-2.5 one can consider more general classes of statistics using F-indexed processes that are more flexible in applications than (28).
References
[1] K.S.Alexander, Probability inequalities for empirical processes and a law of the iterated logarithm, Ann. Probab., 12(1984), no. 4, 1041-1067.
[2] R.M.Dudley, Central limit theorems for empirical measures, Ann. Probab., 6(1978), 899-929.
[3] P.Gaensler, W.Stute, Empirical processes: a survey of results for independent and identically distributed random variables, Ann. Probab., 7(1979), no. 2, 193-243.
[4] E.Gine, J.Zinn, Some limit theorems for empirical processes, Ann. Probab. 12(1984), no. 4, 929-989.
[5] G.R.Shorack, J.A.Wellner, Empirical processes with applications to statistics, John Wiley & Sons, 1986.
[6] A.W.Van der Vaart, J.A.Wellner, Weak convergence and empirical processes, Springer, 1996.
[7] A.W.Van der Vaart, Asymptotic Statistics, Cambridge University Press, 1998.
[8] A.A.Abdushukurov, L.R.Kakadjanova, About one class of empirical processes, Proceedings of the XVI International Conference on eventological mathematics and related issues, Krasnoyarsk, 2012, 31 (in Russian).
[9] A.A.Abdushukurov, L.R.Kakadjanova, Asymptotical Gaussian distribution of a class of empirical processes, Proc. of reports. International Conf. "Problems of modern topology and its applications", Tashkent, 2013, 100-102 (in Russian).
[10] A.A.Abdushukurov, L.R.Kakadjanova, An empirical processes of independence by class of measured functions, Materials of conf. "Statistics and its Applications", Tashkent, 2013, 24-29 (in Russian).
[11] A.A.Abdushukurov, L.R.Kakadjanova, An empirical processes of independence indexed by class of measured functions, Acta National University of Uzbekistan, 1(2014), no. 2, 15-20 (in Russian).
[12] A.A.Abdushukurov, On some estimates of the distribution function under random censorship, Conference of young Scientists, Math. Inst. Acad. Sci. Uzbek SSR, VINITI, 8756(1984) (in Russian).
[13] A.A.Abdushukurov, Nonparametric estimation in proportional hazards model, VINITI, 3448-87(1987) (in Russian).
[14] A.A.Abdushukurov, Estimating of the probability density and intensity function of the Koziol-Green model of random censoring, Sankhya: The Indian J. Statist., 48(1987), 150-168.
[15] S.Csorgo, Estimation in the proportional hazards model of random censorship, Statistics, 19(1988), no. 3, 437-463.
[16] S.Csörgo, J.J.Faraway, The paradoxical nature of the proportional hazards model of random censorship, Statistics, 31(1998), 67-78.
Класс эмпирических процессов независимости
Абдурахим А. Абдушукуров Лейла Р. Какаджанова
В данной статье мы исследуем асимптотические свойства одного класса эмпирических процессов
для определенных классов интегрируемых функций.
Ключевые слова: эмпирические процессы, метрическая энтропия, теоремы Гливенко-Кантелли и
Донскера.