CHI-SQUARED GOODNESS OF FIT TEST FOR GENERALIZED BIRNBAUM-SAUNDERS MODELS FOR RIGHT CENSORED DATA AND ITS RELIABILITY APPLICATIONS
M. S. Nikulin1 and X. Q. Tran1 •
Bordeaux University, 1MB, UMR 5251, F-33400 Talence, France
E-mail: mikhail. nikouline@u-bordeaux2. fr,
xuanquang. tran@math. u-bordeauxl. fr
ABSTRACT
Generalized Birnbaum-Saunders (GBS) distributions are proposed by Díaz-García et al. ([15], [16]) based on the family of elliptically contoured univariate distributions. This model is well-known as the highly flexible lifetime model by the difference in the degrees of kurtosis and asymmetry and processes uni-modality and bimodality. In this paper, a modifier Chi-squared goodness-of-fit test based on Nikulin-Rao-Robson statistics Y^ is developed for the family of GBS distributions for the right censored data with unknown parameters by using the maximum likelihood estimation (MLE). Some applications of this model in survival analysis discuss also in the section of real study.
Keywords and phrases: Birnbaum-Saunders distribution, Breast cancer, Carcinoma data, Chi-squared test, Censoring sample, GBS distributions, Goodness-of fit test, NRR test, Survival analysis.
1 Introduction
In 1969, Birnbaum and Saunders [9] have been proposed a model with two shape and scale parameters that is well known as Birnbaum-Saunders (BS) distribution. After their work, there was a lot of research work on this model and its applications in reliability and survival analysis. It must be mentioned as the work of Desmond [14] who strengthened the physical justification for the use of this distribution by relaxing some assumptions early bade Birnbaum-Saunders. Based on this distribution, Leiva et al. [23] has worked to model survival times of patients with multiple myeloma by using prognostic variables with censored data. A chi-squared test for this model is analyzed by Tahir [37] in 2012. In addition, in the recent research of Nikulin et al. ([30], [2], [29]) considered these applications of this model in the accelerated lifetimes (AFT) models and redundant systems. Nowadays, the BS distribution has known as cumulative damage distributions and it is a very useful in fatigue, reliability and survival analysis. However, its field of application has been extending beyond the original context of material fatigue and reliability analysis.
Therefore, studies to expand of the BS distribution have been looking for researchers in recent years, such as: Owen ([32], [33], [31]) proposed a three parameter Birnbaum-Saunders distribution, in 2000. Later, Volodin and Dzhungurova [38] developed a general family of fatigue life distributions denominated the crack distribution, which includes the Birnbaum-Saunders distribution as a particular case. In particular, we should be mention a generalized family of life distribution which is suggested by Díaz-García et al. [15] in their technical report in 2002, is called as the Generalized Birnbaum-Saunders (GBS) distributions. In his works, Díaz-García was obtained a distribution of the Birnbaum-Saunders type with different degrees of kurtosis, uni-modality, bimodality and absence of moments by basing on the family of elliptically contoured univariate distributions (which known as standard symmetrical distributions in R. A complete review about the
GBS distributions can be found in Sanhueza, Leiva, and Balakrishnan [36]. The purpose of this paper, we analyze a Nikulin-Rao-Robson Y^ goodness-of-fit tests for these distributions in the case of right censoring observations. We also demonstrate the applications of this model by applying it to reliability and survival data.
2 Generalized Birnbaum-Saunders distributions
As is already known, a random variable T following the BS distribution allows the stochastic representation
T = ß
where, Z « N(0,1). T
a2 — + 1
BS(a,ß), a>0,ß>0,
ien the random variable Z may be stochastically represented in the form
Z = -
a
N(0, 1).
(1)
(2)
In 2002, Díaz-García et al. [15] were developed the BS distribution becomes GBS distributions which are related to standard symmetrical distributions in R, also known as elliptically contoured or simple Elliptic distribution ([1], [8], [21], [19], [12]).
If a random variable Z follows an Elliptic distributions which correspond to all the symmetric distribution in R, denoted by Z ~ EC(iu, a2; g), the probability density function fz(z) and cumulative distribution function Fz(z) of Z given by,
ff 2, ((Z-VT Z
fz(z; v,o¿) = cgf
a'
Fz(z; ß,a2) = j fz(u; ^.,a2)du,z E R,\^.\ < œ,a > 0 .
respectively, where, g() is the kernel of the probability density function of Z, c is the positive normalization constant, such that ^ = g(u2)du. The families Elliptic distributions include three sub-models: Kotz Type (KT), Pearson type VII (PVII) and type-III generalized logistic (LIII) distributions, for more details on these distributions is given by Anderson [1] Balakrishnan [8], Fang [19], Gupta [21], Cambanis [12] and others. The Normal, Cauchy, Laplace, Logistic, Power Exponential and t(v)-Student distributions are particular cases of these symmetric sub-classes in R. In table 1 below, we recall some results for kernel function g(), the constant c corresponding with standard symmetric distribution EC(0, 1; g) in R.
Distribution Notation c g(z2),z E R
Normal N(0,1) i exp(-ir)
t(v)-Student t(v) rp±i} T(v+2)Jvñ v+l {1+Í}""'"
Laplace L(0,1) 0.5 e~\z\
Logistic Log(0,1) 1 e~z (l+e~z)2
Cauchy C(0,1) i n 1 1+z2
Power exponential PE(v) V (2v)ñr(±) exp(-^\z|2v)
LIII LIII(q) T(2s) T2(q) eqz (l+ez)2^
Distribution Notation c g(z2),z E R
Pearson VII PVH(q,r) T(q) z2Vq 1 1+7] ,q>\.v>0
Kotz type KT(q,r,s) sr(2q-l)/2s r(^) z2(q~ ~1)e~rz2s,q >±,r>0,s>0
Table 1: Kernel and normalization constants c for some indicated distributions.
Following D/az-Garc/a et al. the random variable T in (1) allows the GBS distributions, denoted by T ~ GBS(a, ft ; g),
t=p + Ja2l +1
iff the random variable Z which is given by the ex
Z =
GBS(a,ß,g),a >0,ß >0, Dression « EC(0,1; g).
So, the probability density function of T can be written as
I ,t > 0,a >0,ß >0,
the cumulative distribution function of T « GBS(a, ß ; g) is expressed by FT(t,a,ß) = Fz{i[J| - Jf|},t >0,a >0,ß > 0,
the GBS hazard rate, survival and cumulative hazard functions are
5 ,, fz{at(a,ß))At(a,ß)
AT(t,a,ß) = —- , , ,
1 - Fz(at(a,ß))
ST(t,a,ß) = 1 - Fz(at(a,ß)) ,and AT(t,a,ß) = -ln[ST(t,a,ß)],
(3)
(4)
(5)
respectively, where
at(a,ß) = \
(9
; A,(a,ß) =
&+ (9
It is clear that the properties of GBS distributions depends on the kernel function gQ) and the unknown parameter 0 = (a,0)T. The statistical theory and methodology of the GBS distributions, also some results for this flexible family of distributions mainly related to transformations, the hazard failure and censored data type II which can be found in the works of Sanhueza, Leiva et al.[36].
Table 2 below shown some probability density function of T « GBS (a, p; g), corresponding the specific symmetric distribution EC(0,1; g) in Table 1.
The Figure 1, 2, 3 and 4 below illustrates some curve of the probability densities and hazard rate functions of T « GBS(a, p ; g), allows with the kernel indicative.
Kernel g() Distribution Probability density function of T « GBS(a, ß ; g), f(t,a,ß;g),(t>0,a>0,ß >0)
N(0, 1) GBS-Normal (BS) 1 2aß\l 2n (ß )1+( ß )1 i 1 eXP 1" ^ f-L + ß - 2I _ ß ' _
Kernel
Distribution
Probability density function of
r
v +1
GBS-Student
2aß r(v + 2)V
ßj2+[ß)2
1 + -
va
t ß o
—+ —- 2 ß t
-(v+1)
L(0, 1)
GBS-Laplace
4aß
1 3
:f]5+(f)5
exp<
Log(0, 1)
GBS-Logistic
2aß
t) ^+1t
3
ßj2
11 t 1
exP \ — 1 a V t J
1 + exp <
a
L - ß
ß V t
C(0, 1)
GBS-Cauchy
2naß
1 3
:f)5+(f)5
1+— a
— + ß - 2 ß t .
Y
KT(q, r, s)
GBS-KT
2qs'aiq-3 I ßt j ^ t J
JT exP<
sr 2s a
2ßr
If t.+ß -
2s
ß t
a
2s
t ß o
—+ —- 2
ß t
j
PVII(q, r)
GBS-PVII
r(q)
2aß4rnr(q -1/2)
1 3
i J5+[f'5
PE(r, s)
GBS-PE
sr
2aK;)
ßj2+[ß )2
r- 2s ]
r t
f expi a2s T' t J
Table 2: The p.d.f of
for some indicated distributions.
Figure 1: Plots of
densities for given kernel
3
1
2
t
t
1
1
2
1
1
3
s
r
3
t
t
Figure 2: Plots of
densities for given kernel
Figure 3: Plots of
failures rates for given kernel
Figure 4: Plots of
failures rates for given kernel
3 Chi-squared type tests for right censored data
Following Bagdonavicius and Nikulin ([3], [4]), we describe a chi-squared test for testing composite parametric hypothesis when data are right censored.
Suppose that are failures time non-negative and independent and the probability
density function of the random variable belong to a parametric family .
The censoring variables are also non-negative and assumed to be random sample. Let
us and are independent. We observed
where,
Xi = Ti ACi, Si = 1{T. < c.},i = 1,2,-,n.
Defined that
f(t, 0)
S(t,0) = P0(T > t); A(t,0) = A(t,0) = -ln{S(t,0)},0 E Q QRm,
be the survival, hazard rate and cumulative hazard functions, respectively. Denote by Gt and gt are the survival and the density function of the censoring time Ct, respectively. Supposing that the right censoring is non-informative which means that the function Gt does not depend on 0. So in this case, we obtain the following expressions for the likelihood function L(0)
n
UO) = ^fs^(Xi,0)Si - Si(Xu 0)g1 - HCjGHCi). i=1
So the members with Gt and gt do not contain 0, so they can be rejected. The likelihood function is obtained
n n
UO) = ^fs^(Xi,0)S1 - HXi,0) = ^As^(Xi,0)S(Xi,0). (7)
i=1 i=1 The estimator 0n maximizing the likelihood function L(0). The log-likelihood function is
n n
1(0) = lnA(X0 0) + lnS(Xu 0)} = ^fo lnA(X0 0) - A(XU 0)}. (8)
i=1 i=1 The maximum likelihood estimator 0n satisfies the system equations
l(0n) = Om,
where 1(0) are the score vectors
jirt - 9 srm - (a((e) d{(0) a{(e)\T l(0)=Tel(0) = ) .
The Fisher information matrix is defined as
1(0) = -Ee£(0),
where
n d2 n d2 1(0) = ^Si — lnA^O) - ^—A(Xt,0).
i=1 i=1 Supposing that 0O is the true value of 0, under some regularity conditions, we have
0n ^ 0O; -JW(0n - 0O)= i~1(0o)-=i(0o) + 0P(1) ,—= l(0n) ^ i(0o),
M6n- 00)^ Nm(O,i~1(0o)); ~^i(0o)^Nm(O,i(0o)),
-n
where, 0n are the maximum likelihood estimation of 0 and the matrix
i(0o) = lim -.
n ^ CO n
For any t > 0, set
if t >Xt and 8t = 1, _(1, if t <Xt,
{t >Xi,Si=1} "
N(t)-1 -f1' if and St-1'. Y(t)-1 -f1'
W) - 1{t ^л=1] - ^ if о < t <Xt . ' Yi(t) - ^ ^ - (о,
n n
N(t) - YJNi(t) ' Y(l) -
if t >Xt.
i = 1 i=1
The process N(t) shows the number of observed failures in the interval [0, t] and the process
Y(t) shows the number of objects which are "at risk" just prior to time t. The sample (6) is
equivalent to the sample
(N1(t),Y1(t),t > 0),(N2(t),Y2(t),t > 0),-,(Nn(t),Yn(t),t > 0). (9)
For the sample (9), the parametric log-likelihood function can be written by expression follows
1(0) = f+™{\nA(u, 0) dN(u) - Y(u)A(u, 0)}du. The score function is
oo oo
(d f d 1(0) = I — \nA(u,0){dN(u) -Y(u)A(u,0)du] = I — \nA(u,0) dM(u,0), J 00 J 00
0 0
and
n ™ d2 n ™ d d T m= g¥lnA(u,0)dMi(u,0)-Yjj [—\nA(u,0)j(—\nA(u,0)j A(u,0)Yi(u)du,
t=l0 i=l0 where, Mt(t,0) = Nt(t) - fQ Yi(u)A(u,0)du,(0 E Q) is the zero mean martingale with respect to the filtration generated by the data.
Suppose that the processes Nt and Yt are observed for finite time t > 0, which means that at time t, observation on all surviving objects are censored, and so instead of using censoring time Ct. In this case, the matrix Fisher information can be written as
n ™ d d T 1(0) = -Egi(0) = fy^J {'d^\nA(u,0))(-d^\nA(u,0)) *(u,0)Yi(u)du.
t = l o
Let be consider next the hypothesis
H0 : F(x) E T0 = {Fo(x,0),0E QQRm }, here, 0 = (01, d2, ••• , dm )T are an unknown m-dimensional parameters and F0 is a known distribution function.
Subdividing the interval [0,r] into k > m smaller intervals Ij = ^aj_1,aj], with a0 = 0, ak = r, and denote by
Uj = N(aj) - N(aj_1), the number of observed failures in the jth interval Ij, (j = 1, 2, ••• , k ). Let
ej = j A(u,0n)Y(u)du.
"7—1
A chi-squared test which was proposed by Bagdonavicius and Nikulin [21], based on the vector
1
Z = (Z1,Z1,-,Z1)T, with Zj = —(Uj - ej),j = 1,2,•, k. (10)
Under the conditions
1) There exists a neighborhood Q0 of 0O such that for all n and 0 E Q0, and almost all t E [0, t], the partial derivatives of A(t, 0) of the first, second and third order with respect to 0 exist and are continuous in 0 for 0 E Q0. Moreover, they are bound in [0, r] x Q0 and the log-likelihood function may be differentiated three times with respect to 0 E Q0, by interchanging the order of integration and differentiation.
2) A(t, 0) is bound away from zero in [0, r] x Q0.
3) A positive deterministic function y(t) exists such that supt E[0,T]
KB )
4) Under condition 1) - 3), the matrix i(90) = hmn ^ is positive definite. The statistic of Bagdonavicius and Nikulin given as
Y2n(en) = zTi~z, (11)
where, is the general inverse matrix of the covariance matrix £,
= A-1 + A-^g^cA-1, g = i- cA~xcT, (12)
A is the diagonal k x k matrix with the elements Aj = on the diagonal, A~1 is inverse matrix of A, and
C = [Cij]mxk ,with £lj = ^Zi-.XieijSidXn?^^i'd),l = 1,2,-, m, j = 1, 2,■■■ , k, (13)
lV dlnA(X0 9)dlnA(X0 9)
1= [hl']mxm ,with hi' = "¿/i--Q~B--dd~,-,l,V = 1'2'"' , m' (14)
i=l 1 1
From the definition of Z in (10), the test statistic (9n) should be written as
Yn(0n)=xn+Q, (15)
where,
" (Uj - ef)2
An
= TJu6 , Q = §= i-
j=i 7
Under the hypothesis H0, the limiting distribution of the statistics Y2(0n) is chi-squared with r = rank(I~) degrees of freedom that is,
lim P{Yl(dn) > x | H0) = P{x2 > x],for any x > 0.
n ^ TO
Statistical inference for the hypothesis H0: The null hypothesis H0 is rejected with approximate significance level a if Y2(0n) > xl(r) or Y*(dn) < xl-a(r) depending on an alternative, where Xa(r) and Xi-a(r) corresponding are the upper and lower a percentage points of the ^r distribution, respectively.
Using the method of interval selection which is proposed by Bagdonavicius, and Nikulin [20], we used a.j as the random data function. Define
Ek = YH=iA(Xi,On), Ej = ^Ek, j = 1,2, — ,k. Denote by X^), X(2), — , X(n) the ordered sample from X1, X2, — , Xn. Set
bt = (n-i)A(X(i),en) +YjKxd),0n),
1=1
i = 1,2, — ,n,
if i is the smallest natural number verifying bi_1 < Ej < bt then dj verifying the equality
t-i
(n-i + 1)A(aj,0n ) + ^A(X(D,0n) = Ej
i=i
So
*J= A~1{Ej- ^-A+T Gn)'d-)] = max{X(n),r}, (j = 1,2, — ,k - 1). (16)
where A"1 is the inverse of the function A. We have: 0 < a1 < a2 < — < ak, with this choice of intervals, then ej = for all j.
Application for GBS distributions: In particular, we shall give chi-squared tests NRR for the hypothesis H0 that the data Xt are coming from the GBS distributions with the probability
density, cumulative distribution, hazard rate, survival and cumulative hazard functions give in formulas (3), (4) and (5), respectively.
The GBS log-likelihood functions 1(6), (6 = (a,p)T) is
n f 1 3n n
1(0) = -Slna- 8lnp + X5ilnj(|:)2+ (*)[ + YJSi\n{g(Ki(a,p))}
t=i
t=i
i=1
Let 6n = (â,/?) be maximum likelihood estimations which are solutions of the non-linear system equations
(ia(8)jp(8)) = 02.
Using the formula (13) - (14), the elements îu<, (I, I' = 1,2) of the Fisher information matrix
î= ihi'h x2 are
- 1 Vxi (v(~r\\ Âi(â,ê)fz(Âi(â,ê))]2
Lii
i=l
L22
L12
= 4-Y«
np2^ 1 r t=i
2 1+£ 2 ,HJ V ^ ^ 2 1- Fz(Ai(a,p))
Fz(At(â,p))
1 Bi(â,p)fz(Âi(â,p))
= 5i [-1 + (MM))- —
11 + 3Î 1
Fz(Ai(âip))
1+21+L 2
Xl
l- + -jÂi(â,p)Bi(â,p)v (fft(M)) ---f-
1 Bt(â,p)fz(At(â,p))
2 1- Fz(At(â,p))
and the matrix C = [A/]2 xk given by
^ = ¿ Z -1 + Ki{â,p)v{Ki{â,p))
Âi(â,p)fz(Âi(â,p))
"2j= 5i
C
where,
i-.Xi eij
UXi elj , 1- Fz(Ai(â,P))
11 + 3-^1 , \ 1 B-
? + -Jii(â,p)Bi(â,p)v (K^aJ)) -- -1
-1+-
1 Bi(â,p)fz(Âi(â,p))
2 1+L 2
x,
2 1 - Fz(At(a,p))
(L
, 2
№1+ ffi
iA
2
and fz(u) = c g(u2), Fz() are the probability density function and cumulative function of the random variable Z ~ EC(0,1; g) which follows a standard symmetrical distribution in R with the kernel g(), respectively, and
v(u) = -2w(u); w(u) = ^ ( ^, u > 0,
g(u)
are the transformations functions of kernel function g(u), and w'(u) is the derivative of w(u) ([15], [16]). Table 3 below shown some transformations functions w(u) and its derivative w'(u), (u > 0) corresponding with kernel g(u) of indicated Elliptic distributions EC(0,1; g).
N(0,1)_t(y}_L(0,1)_Log(0,1)_PE(v)
w(u) -1 v + 1 1 1 (Ju\ V l( , --—tanh [-— 1 -tX"1 2JH \2 ) 2
~2 2(v + u)
w'(u) 0 v + 1 1 sinhV^- ^u v[v — 1] v 7
2(v + u)2 4u^u 4u^u[1 + cosh^u] 2
Table 3: Transformations functions w(u) and its derivative w'(u), (u > 0) for kernel g(u) of the
indicated Elliptic distributions EC(0,1; g).
Chi-squared test for GBS distributions: Under the hypothesis H0, the matrix Q is non-degenerate. So, the hypothesis H0 is rejected with approximate significance level a if
Y2n(en) > xl(o).
It is necessary to note that: PE(v) = KT(1,0.5,v),N(0, 1) = KT(1,0.5,1),L(0,1) = PE(0.5),t(v) = PVll([v + 1]/2,v),Log(0,1) = Llll(1).
Thus, the next section, we consider goodness of fit test for following five distributions: GBS-N(0, 1) which known as BS distribution, GBS-Laplace, GBS-Logistic, GBS-t(v) and GBS-Cauchy distributions.
4 Real study
All distributions presented in the next two examples by using R statistics software, we analyze the goodness of fit test for the parametric generalized BS distributions in two studies to a data of breast cancer which set from research of Boag (1949) and the data from a laboratory investigation in which the vaginas of rats were painted with the carcinogen DMBA of Pike (1966).
4.1 Analysis of breast cancer data
Boag [10] was presented the survival times for 121 patients treated for cancer of the breast in one particular hospital during the years 1929-1938 which given in table below. The times are in months, and asterisks denote censoring times. This data included 66 observations and 55 censoring times.
0.3 7.4* 13.5 16.8 21.0 29.1 37* 41 45* 52 60* 78 105 *
129* 0.3* 7.5 14.4 17.2 21.1 30 38 41 46* 54 61* 80
109* 129* 4.0* 8.4 14.4 17.3 23.0 31 38* 41* 46* 55* 62*
83* 109* 139* 4.0* 8.4 14.4 17.3 23.0 31 38* 41* 46* 55* 62
83* 109* 139* 5.0 8.4 14.8 17.5 23.4* 31 38* 42 47* 56
65* 88* 111* 154* 5.6 10.3 15.5* 17.9 23.6 32 39* 43* 48
57* 65* 89 115* 6.2 11.0 15.7 19.8 24.0 35 39* 43* 49
58* 67* 90 117* 6.3 11.8 16.2 20.4 24.0 35 40 43* 51
59* 67* 93* 125* 6.6 12.2 16.3 20.9 27.9 37* 40* 44 51
60 68* 96* 126 6.8 12.3 16.5 21.0 28.2 37* 40* * 5 4 51
60* 69* 103* 127*
Firstly, we consider the hypothesis H0 that the survival times for 121 breast cancer patients belongs the Birnbaum-Saunders distribution. In this case, MLE's of the parameters 0 = (a, p)T of the BS distribution are 0n = (2.04798,47.11415)T.
Choosing the sub-intervals k = 6. The values of a.j, the frequency vector Z and the elements of the matrix C give in table follows.
/ 1 2 3 4 5 6
a.j 5.312384 9.476277 15.352179 24.889209 41.374625 154.000010
"i 2 8 9 19 15 13
ei 12.40306 12.40306 12.40306 12.40306 12.40306 12.40306
Zi -0.945732 -0.400278 -0.309368 0.599721 0.236085 0.054267
Cn 0.149316 0.015361 -0.008443 -0.042270 -0.052283 -0.058149
-0.003347 -0.000867 -0.000594 -0.001079 -0.000913 -0.001021
The matrix information of Fisher I = [îa']2 X2 and the matrix £ are
j = 2.62227 -0.0551421 -g = [ 1.203914 -0.0259961 I--0.055142 0.001280 J' 9 I--0.025996 0.000562 J.
We continued this data for GBS distributions for another kernel g: Logistic, Laplace, Cauchy and t(v) distribution with the same sub-intervals. The results give in table 4 follow.
Distribution 0n = V2 An Q Y2 1 n pv o.o5-value
GBS-Cauchy (0.95391,47.25487)T 148.2783 3.593348 151.8717 0
BGS-L(0,1) (1.36965, 51.99999)T 9.916332 0.772807 10.68914 0.0984723
GBS-Log(0,1) (0.95750, 54.63219)T 6.586821 1.861331 8.448152 0.2070737
GBS-t(100) (1.89550, 50.25370)T 20.70194 8.548100 29.25004 5.4553.10"5
GBS-t(5) (1.33215, 55.86467)T 5.652789 0.897494 6.550284 0.3644431
Table 4: MLE's of 0 = (a,fi)T, values of and ^-values with indicated kernel distributions,
data of Boag (1949).
In this example, we suggest that GBS with kernel g: Normal, t(100) and Cauchy are strongly rejected and GBS with Logistic, Laplace and t(5) kernels are very well in concordance with the survival times for breast cancer patients treated of Boag. Figure 5 below illustrates the curve of Kaplan-Meier estimate of survival function, with the curve of GBS survival functions corresponding with the kernel indicatives.
4.2 Analysis of the times until a carcinoma appeared
Pike [34] gave some data from a laboratory investigation in which the vaginas of rats were painted with the carcinogen DMBA, and the number of days T until a carcinoma appeared was recorded. The data below are for a group of 19 rats (Group 1 in Pike's paper). The two observations with asterisks are censoring times.
143, 164, 188, 188, 190, 192, 206, 209, 213, 216, 220, 227, 230, 234, 246, 265, 304, 216*, 244*.
This data analyzed by Lawless [25], pp.188 where he suggested that probability plots for two parameters Weibull distribution. By using the NRR statistics for two parameters Weibull distribution, we obtain the value of NRR statistics Y^ = 6.658669 with p-value at level significance a = 0.05 is pv005 = 0.0 8 3 6 1 0 68.
Figure 5: GBS with indicated kernel, Weibull and Kaplan-Meier estimate of for data of Boag (1949).
Figure 6: GBS with indicated kernel and Kaplan-Meier estimate of for the data of Pike (1966).
We consider next the hypotheses that the data above follows the GBS distributions in the cases of kernel : Standard Normal distribution , Standard Logistic distribution ,
Standard Cauchy distribution . Choosing grouping intervals , the results given in table
5 below.
Distribution -value
GBS- 3.125849 16.35656 19.48241 0.0006316
GBS- 1.000369 2.885825 3.886194 0.4216269
GBS- 0.749583 0.325662 1.075246 0.8981795
GBS- 0.745397 0.3637746 1.109172 0.8928143
Table 5: MLE's of , values of and ^-values with indicated kernel distributions,
data of Pike (1966).
We plot the estimated GBS survivor functions correspond indicated kernel
and the Kaplan-Meier estimate for the data of Pike (1966) in Figure 6.
In this example, it is clear that the data are the best in concord with GBS-Logistic, GBS-Cauchy and GBS- distributions, it also acceptes for two parameters Weibull distribution. However, these data contradict the BS distribution much very strongly.
5 Summary and conclusion
In this paper, we have presented a modifier Chi-squared goodness-of-fit test for generalized Birnbaum-Saunders distributions. The results obtained in our examples show that the considered families are in accordance with lifetimes data. In addition, its hazard rate functions can be unimodal or bimodal by adjusting the values of its parameters and its kernel . So, it is necessary to use it as a baseline hazard rate functions in the parametric survival model. We would like to thank our colleagues, PhD. R. Tahir and N. Saaidia for valuable comments, which helped us improve the presentation.
References
[1] Anderson, T. W. and Fang, K. T. Statistical inference in elliptically contoured and related distributions, Allerton Press New York, 1990.
[2] Bagdonavicius, V. B. and Nikulin, M. S. Statistical models to analyze failure, wear, fatigue, and degradation data with explanatory variables. Communications in Statistics—Theory and Methods, vol. 38, no. 16-17, pp. 3031-3047, 2009.
[3] Bagdonavicius, V. B. and Nikulin, M. S. Chi-squared tests for general composite hypotheses from censored samples. Comptes RendusMathematique, vol. 349, no. 3, pp. 219-223, 2011.
[4] Bagdonavicius, V. B. and Nikulin, M. S. Chi-squared goodness-of-fit test for right censored data. International Journal of Applied Mathematics and Statistics, vol. 24, no. SI-11A, pp. 30-50, 2011.
[5] Bagdonavicius, V. B., Krupois, J. and Nikulin, M. S. Non-parametric Tests for Censored Data, Wiley, 2011.
[6] Bagdonavicius, V. B., Levuliene, R. J. and Nikulin, M. S. Exact goodness-of-fit tests for shape-scale families and type II censoring. Lifetime Data Analysis, pp. 1-23, 2012.
[7] Bagdonavicius, V. B. and M. S. Nikulin, Statistical methods to analyse failures of complex systems in presence of wear, fatigue and degradation: An engineering perspective in accelerated trials. In Electronic Instrument Engineering, 2008. APEIE 2008. 9th International Conference on Actual Problems of, 2008.
[8] Balakrishnan, N. Handbook of the logistic distribution, vol. 123, CRC Press, 1992.
[9] Birnbaum, Z. W. and Saunders, S. C. A new family of life distributions. Journal of Applied Probability, pp. 319-327, 1969.
[10] Boag, J. W. Maximum likelihood estimates of the proportion of patients cured by cancer therapy. Journal of the Royal Statistical Society. Series B (Methodological), vol. 11, no. 1, pp. 15-53, 1949.
[11] BolShev, L. N. and Mirvaliev, M. Chi Square Goodness-of-Fit Test for the Poisson, Binomial and Negative Binomial Distributions. Theory of Probability \&Its Applications, vol. 23, no. 3, pp. 461-474, 1979.
[12] Cambanis, S., Huang, S. and Simons, G. On the theory of elliptically contoured distributions. Journal of Multivariate Analysis, vol. 11, no. 3, pp. 368-385, 1981.
[13] Chernoff, H. and Lehmann, E. L. The use of maximum likelihood estimates in x2 tests for goodness of fit. The Annals of Mathematical Statistics, vol. 25, no. 3, pp. 579-586, 1954.
[14] Desmond, A. Stochastic models of failure in random environments. Canadian Journal of Statistics, vol. 13, no. 3, pp. 171-183, 1985.
[15] Díaz-García, J. A. and Leiva-Sánchez, V. A new family of life distributions based on Birnbaum-Saunders distribution. Technical report I-02-17 (PE/CIMAT), Mexico. 2002.
[16] Díaz-García, J. A. and Leiva-Sánchez, V. A new family of life distributions based on the elliptically contoured distributions. Journal of Statistical Planning and Inference, vol. 128, no. 2, pp. 445-457, 2005.
[17] Drost, F. C. Asymptotics for generalized chi-square goodness-of-fit tests. CWI Tracts, vol. 48, pp. 1-104, 1988.
[18] Dzaparidze, K. O. and Nikulin, M. S. On a modification of the standard statistics of Pearson. Theory of Probability & Its Applications, vol. 19, no. 4, pp. 851-853, 1975.
[19] Fang, K. T., Kotz S. and Kai Wang, Ng. Symmetric Multivariate and Related Distributions, Monographs on Statistics and Applied Probability. 36, London: Chapman and Hall Ltd. MR1071174, 1990.
[20] Greenwood, P. E. and Nikulin, M. S. A guide to chi-squared testing. Wiley New York, 1996.
[21] Gupta, A. K. and Varga, T. Elliptically contoured models in statistics. Kluwer Academic Publishers, 1993.
[22] LeCam, L., Mahan, C., and Singh, A. An extension of a theorem of H. Chernoff and EL Lehmann. Recent advances in statistics, 303-332, 1983.
[23] Leiva, V., Barros, M., and Galea, G. A. Influence diagnostics in log-Birnbaum--Saunders regression models with censored data. Computational Statistics & Data Analysis, vol. 51, no. 12, pp. 5694-5707, 2007.
[24] Leiva, V., Riquelme, M., Balakrishnan, N. and Sanhueza, A. Lifetime analysis based on the generalized Birnbaum-Saunders distribution. Computational Statistics & Data Analysis, vol. 52, no. 4, pp. 2079-2097, 2008.
[25] Lawless, J. F. Statistical models and methods for lifetime data. New Jersey: John Wiley and Sons Publishers, 2003.
[26] Nikulin, M. S. Chi-square test for normality. In Proceedings of International Vilnius Conference on Probability Theory and Mathematical Statistics, 1973 a.
[27] Nikulin, M. S. On a chi-square test for continuous distribution. Theory of Probability and its Application, vol. 19, pp. 638-639, 1973b.
[28] Nikulin, M. S. Chi-square test for continuous distributions with location and scale parameters. Teoriya Veroyatnostei i ee Primeneniya, vol. 18, no. 3, pp. 583-591, 1973c.
[29] Nikulin, M. S., Saaidia, N. and Tahir, R. Reliability analysis of redundant systems by simulation for data with unimodal hazard rate functions. Journal\ MESA, vol. 2, no. 3, pp. 277-286, 2011.
[30] Nikulin, M. S., Saaidia, N. and Tahir, R. Recent Results in the Analysis of Redundant Systems. Recent Advances in System Reliability, pp. 181-193, 2012.
[31] Owen, W. J. A new three-parameter extension to the Birnbaum-Saunders distribution. Reliability, IEEE Transactions on, vol. 55, no. 3, pp. 475-479, 2006.
[32] Owen, W. J. and Padgett, W. J. A Birnbaum-Saunders accelerated life model. Reliability, IEEE Transactions on, vol. 49, no. 2, pp. 224-229, 2000.
[33] Owen, W. J. and Padgett, W. J. Power-law Accelerated Birnbaum-Saunders life models. International Journal of Reliability, Quality and Safety Engineering, vol. 7, no. 01, pp. 1-15, 2000.
[34] Pike, M. C. A method of analysis of a certain class of experiments in carcinogenesis. Biometrics, vol. 22, no. 1, pp. 142-161, 1966.
[35] Rao, K. C. and Robson, B. S. A chi-squabe statistic for goodies-of-fit tests within the exponential family. Communications in Statistics-Theory and Methods, vol. 3, no. 12, pp. 1139-1153, 1974.
[36] Sanhueza, A., Leiva, V. and Balakrishnan, N. The generalized Birnbaum-Saunders distribution and its theory, methodology, and application. Communications in Statistics— Theory and Methods, vol. 37, no. 5, pp. 645-670, 2008.
[37] Tahir, R. On Validation Of Parametric Models Applied In Survival Analysis And Reliabilty. Thesis of University Bordeaux I, 2012.
[38] Volodin, I. N. and Dzhungurova, O. A. On limit distributions emerging in the generalized Birnbaum-Saunders model. Journal of Mathematical Sciences, vol. 99, no. 3, pp. 1348-1366, 2000.