A MODIFIED INVERSE WEIBULL DISTRIBUTION USING KM TRANSFORMATION
Gauthami P., Mariyamma K. D. and Kavya P.
University of Calicut [email protected] [email protected] [email protected]
Abstract
In the subject of reliability engineering and statistics, a new reliability model is proposed, where survival analysis or life time data analysis is of major importance in the current scenario. The goal of this study is to introduce a new model that has applications to real data sets from the field of survival analysis. Deriving out the new model there are various methods to propose a new model, one of them is by using the method of transforming a variable to the variable of interest and there are numerous transformation methods which are in use right now. The newly proposed model is achieved by using the transformation method known as KM Transformation where it does not require any additional parameters to the baseline distribution which absolutely is an advantage. The model considered in this paper as baseline model is Inverse Weibull distribution with two parameters, one is a scale and other is a shape parameter. Inverse Weibull distribution is a continuous probability distribution which presently has great applications in real life phenomenon as well as so many modifications and advanced studies are introduced in this distribution from various fields. A proper study on the newly proposed model is done by deriving out its various functions and statistical properties such as Probability density function, Cumulative distribution function, Hazard rate function, Moments, Moment generating function, Characteristic function, Quantile function, Order statistics, etc. along with its Probability density function plot and Hazard rate function plot which have both upside-down and decreasing curves. Focusing on the inference procedures, the estimation of the parameters involved in this model is done by using the method of Maximum likelihood estimation. A simulation study for valuing the parameter consistency using two parameter combinations is carried out as well as a data analysis on an actual data set is also conducted. A comparison of the newly proposed model with other popular well-known models such as Inverse Weibull distribution (IW), KME distribution and KMW distribution using R programming language yielded that the new model is a better fit for the real data considered in this paper. The results and conclusions achieved throughout the paper are also mentioned at the last.
Keywords: KM Transformation, Inverse Weibull distribution, upside-down curve, decreasing
curve
1. Introduction
In the current scenario of model building and real life data analysis there are so many lifetime distributions that are in use for which [14] and [15] explains the basic ideas and concepts. Among these well-known distributions, Weibull distribution and its various modified Weibull distributions plays a great role in fitting real data sets. Inverse Weibull (IW) distribution has great applications in reliability engineering, where [3] and [4] studies the various inference procedures in the distribution of consideration. A theoretical analysis of IW distribution is done by [12] and order statistics with inference is done by [16]. [5] studies the generalized modified weibull distribution along with applications to two real data sets and [2] applied the IW distribution to
Gauthami P., Mariyamma K. D., Kavya P. RT&A, No 1 (72)
KM-IW DISTRIBUTION Volume 18, March 2023
model the wind speed data. Extended studies are done in this distribution where [13] studies the Bayesian inference and prediction of the IW distribution for type-II censored data and [10] produced a new model, generalized IW distribution.
Deriving out a new model with a baseline distribution using a transformation is common in practice but finding a model that best fits for a real data set than other related models are of great relavance. There are so many transformations in existance where the transformation named KM Transformation that studied in [11] is a recent development in the subject in which Exponential distribution and Weibull distribution are used as baseline models to fit real data sets. In this paper, we consider two-parameter Inverse Weibull distribution (IW) as baseline distribution to perform KM Transformation.
This paper is about the KM-IW Model which consists of the introduction to the model with its probability density function (pdf), cumulative distribution function (cdf) and hazard function in section 2. The basic properties of a model such as Moments, Moment generating function, Quantile function and Distribution of order Statistic are studied in Section 3. In section 4, estimation of the model parameters are done by using the method of maximum likelihood estimation. Section 5 gives the results on the simulation study of the model and section 6 gives the results on the fitting of KM-IW model to a real data set. And the conclusions achieved throughout the paper is mentioned in Section 7.
2. KM-IW Distribution
Let X be the random variable of interest. Then using the KM Transformation method the pdf, cdf and hazard function of X is obtained by the formulae given below respectively.
f (x) = 7TT e-G(x)
F(x) = [1 - e-G(x)]
-G(x
■ I I - y
e -1 and
g(x) e1-G(x)
h(x)
e1-G(x) - 1
where, g(x) and G(x) are the pdf and cdf of IW distribution which are as given below.
-(a+1)
(0 ViS,1
and
a fx\ -(a + 1) _ (x\-a
g(x) = J (JJ e-l, x > 0, a, J > 0
G(x) = e VN , x > 0, a, J > 0
Then the following are being the pdf, cdf and hazard function of our new model KM-IW distribution.
/ \ -(a+1) ( )-a ixA a
f(x) = J (jj ^f) e-e W , x > 0, a, J > 0 (1)
F(x) e
e1
1 e-
■a '
V1
, x > 0, a, J > 0 (2)
and
I (J)-("+,) e'-(« -
h(x) = ----i-, x > 0, a, J > 0 (3)
e1-e-( ?0 - 1
This newly proposed KM-IW Distribution has the pdf and hazard function plots as given below which are obtained using the R programming language.
a
e
a
2
36
Probability density function
; — a=0.4,b=0 2
---- a-1.5,b-0 7
a=0.2,b=1 4
------ 1 • t:
|j \
\ Si ' K '
- -
0 1 2 3 4 5
Figure 1: Probability Density Function of KM-IW Distribution
Hazard rate function
— a=0.4,b=0.2
---- a-1.5,b-0.7
a=0.25,b=1.5
/ \ ------ a=1.5,b=2
'■ \ '
Figure 2: Hazard Rate Function of KM-IW Distribution
As seen in the plots, both pdf and hazard rate function have upside-down and decreasing curves for different combinations of a and ft.
3. Properties of KM-IW Distribution
Here we discuss some statistical properties of the new model. They are Moments of the distribution, Moment generating function, Characteristic function, Quantile function and Order Statistic.
3.1. Moments of the Distribution
The rth raw moment about origin of the distribution is obtianed as below. F'r = E (Xr)
p TO
= xr f (x) dx
e a fx^ >+1) e-(x r e-e^*
e - 1 ß \ß
dx
—a
r
x
0
The exponential term is expanded, so we get,
"r
a Ja e r e - 1 70
xrx-(a+i)e l
-a TO
E
m=0
-e vJj
m!
dx
On proper substitution and simplification, we get,
= E (-1)" r (-a + ^ (m + 1)i Jr
"r = „ 1 E
e1
m=0
(m +1)
(4)
Putting r = 1, we get the 1st raw moment about origin (mean of the distribution) and is given by,
"1 = E (X)
i.e.,
"1
e1
E
m=0
(-1)m r (-1 + 1) (m + 1)JJ
(m +1)!
(5)
3.2. Moment Generating Function
The moment generating function (mgf) of the distribution is obtianed as below. Mx(t) = E (etX)
f TO
= J etxf(x) dx
-(a+1) (
TO tr e at x e
./0 e - 1 ^J on expanding the exponential term we get,
e Vj; e
a - x
e
dx
Mx (t)
e-1 0 aJa e ^ (-1)m rTO
E
m=0
-e vJy
dx
E
(a+1)e-(m+1)
m!
)-a TO f,\n
e -1 m=0 m! ^
x- e
E
n=0
(tx)n
n!
dx
after simplification, we get, Mx (t) =
e1
EE
to to (-1)m tn r (-1 + 1) (m + 1)n Jn (m + 1)!n!
m=0 n=0
Differentiating the mgf will yield the moments about origin of the distribution. Similarly, the characteristic function is obtianed as
<£x (t)
e1
EE
m=0 n=0
TO TO (-1)m (¿t)n r (-n + 1) (m + 1)nJn
(m +1)!n!
where, z2 = -1
(6)
(7)
3.3. Quantile Function
For obtaining the pth quantile function (Q(p)) of KM-IW distribution, we solve F(Q(p)) = p, where 0 < p < 1. And is obtained as,
Q( P) = J
- log
- log 1 -
p(e -1)
(8)
m
a
e
m
a
J
e
e
3.4. Order statistic
Let the random sample of size n, X1, X2,..., Xn be from the new distribution. Then X^), X(2),..., X(n) are the order statistics respectively. The pdf and cdf of the rth order statistic fr (x) and Fr (x) are given by
n!
fr(x) =, Fr-1(x) [1 - F(x)]n-rf (x)
and
(r - 1)!(n - r)!
Fr(x) = E n (x)[1 - F(x)]n-j
j=r ^ '
And those of our newly proposed model are,
fr(x)
n! aßa
x
-(a+1)
r- x -e ß
(r - 1)!(n - r)!(e - 1)n
\ -a\ r-1 / / ^ -a
„1-e Vß
1- e-e^ß
e
-1
(9)
and
Fr (x) = E
j=r
ej
j J (e - 1)n
1 e-
-X\ ]
e^-e ^ -1
n-]
(10)
respectively.
n
a
nr
ß
n
e
a
4. Estimation of parameters
This section is about the estimates of the parameters involved in the distribution. Here we are
using the maximum likelihood estimation method.
The likelihood function of KM-IW distribution is found by,
n
L(x, a, ft) = n f (xi, a, ft)
i=1
i.e., we will have,
L(x)
e-1 ß
nn
an
n /Y.\ -(a+11)'
e-EU( f) e-E=1 e-
So that the log-likelihood function becomes,
n
log L = n log e - n log(e - 1) + n log a + na log ft - (a + 1) £ log xi
i=1
n ( \-a n f xi\-a
- s(f) - Se-(T) (11)
The partial derivatives of log L with respect to the unknown parameters a and ft are obtianed. Equating these non-linear equations to zero gives the MLEs of a and ft.
'1 + n log ft - Dog x, + £ ( ft\-a log ( ft \ - £ f«r (Ilog ( ft \ = 0 (12)
—a
e
n
J
n - s( f)+S( J
(13)
These equations 12 and 13 cannot be solved analytically and by using statistical softwares it can be possible.
a
a
x
a
0
e
5. Simulation Study
For different combinations of a and J, samples of sizes 25, 50, 100, 500 and 1000 are generated from the KM-IW model.
We calculate the bias and mean square error (MSE)s of the estimates. Simulation is conducted for two different combinations of parameter values which are a = 2, J = 0.5 and a = 0.5, J = 1.5. As the sample size increases, the mse value decreases (see Table 1 and 2). The bias of the estimates approaches to zero as n increases. And the estimate values approaches to the true parameter values.
Table 1: Simulation study at a = 2 and J = 0.5
n Estimated value of Parameters Bias MSE
25 a = 2.0849208 0.1165587 0.1566783
j8 = 0.5670579 0.005577362 0.002953267
50 I = 1.8655174 0.0514391 0.06337958
¡5 = 0.5121194 0.00162314 0.001382758
100 a = 2.000336 0.0207338 0.02929086
5 = 0.513538 0.001644057 0.0006789977
500 a = 1.9363471 -0.01219562 0.004949019
5 = 0.5115159 0.0007829622 0.0001313705
1000 I = 2.021294 -0.01847309 0.002710569
¡5 = 0.488996 0.0008567411 6.075921 x 10-05
Table 2: Simulation study at a = 0.5 and J = 1.5
n Estimated value of Parameters Bias MSE
25 I = 0.5212292 0.02913895 0.009792336
5 = 2.4815462 0.1813135 0.6622222
50 I = 0.4663787 0.01285905 0.003961199
5 = 1.6508171 0.07065759 0.2386087
100 a = 0.5000835 0.005182735 0.00183067
¡5 = 1.6691755 0.04467576 0.1099399
500 I = 0.4840861 -0.003049672 0.0003093182
5 = 1.6430402 0.01415897 0.01952109
1000 I = 0.5053225 -0.004619055 0.0001694182
¡5 = 1.3722554 0.0124857 0.008962368
6. Real Data Analysis
Here we use KM-IW(a, J) distribution to fit a real data set and compare the results with IW distribution, KME distribution and KMW distribution. The data-set, considered here, represents survival times in Days, from a Two-Arm Clinical Trial considered by [8] and [18]. The survival time in days for the 31 patients from Arm B are:
Table 3: survival time in days for the 32 patients from Arm B
37 84 92 94 110 112 119 127 130
133 140 146 155 159 173 179 194 195
209 249 281 319 339 432 469 519 633
725 817 1557 1776
The analysis is performed by using R programming language. Table 4 gives the estimates of the model parameters, AIC (Akaike information criterion) and the BIC (Bayesian information criterion) values.
AIC = —21 + 2k (14)
BIC = —21 + k log n (15)
where, 1 denotes the log-likelihood function, k is the number of parameters and n is the sample size. And Kolmogrov-Simnorov (K-S) test is also performed and the p-value is used for comparison.
Table 4: Results of the Data Analysis
Model Estimates AIC BIC KS statistic p-value
KM-IW a = 1.1949 ß = 190.7256 417.3394 420.2074 0.0847 0.9657
IW a = 1.3375 ß = 150.4226 417.4484 420.3164 0.0897 0.9452
KME A = 0.0022 426.1304 427.5644 0.2112 0.1085
KMW a = 1.1864 ß = 464.2267 426.3770 429.2450 0.1779 0.2493
From the Table 4, we can see that our model KM-IW has better AIC, BIC values, KS statistic value and p-value than IW, KMW and KME distributions. So we can conclude that the newly proposed model is a better fit for the data taken than the other models.
7. Conclusions
A new model is introduced by modifying the Inverse Weibull distribution using the KM Transformation. Its statistical properties are studied along with the estimation of parameters. A simulation study is carried out for 2 parameter combinations and a data set is fitted by the KM-IW model which yielded a better AIC, BIC, KS-statistic values and p-value than the models compared in the study.
References
[1] Adamidis, K., and Loukas, S. (1998). A lifetime distribution with decreasing failure rate. Statistics and Probability Letters, 39(1):35-42.
[2] Akgul, F. G., Senoglu, B. and Arslan, T. (2016). An alternative distribution to Weibull for modeling the wind speed data: Inverse Weibull distribution. Energy Conversion and Management, 114:234-240.
[3] Calabria, R. and Pulcini, G. (1989). Confidence Limits for Reliability and Tolerance Limits in the Inverse Weibull Distribution. Reliability Engineering & System Safety, 24:77-85.
[4] Calabria, R. and Pulcini, G. (1994). Bayes 2-sample prediction for the inverse weibull distribution. Communication in Statistics-Theory and Methods, 23:1811-1824.
[5] Carrasco, J. M. F., Ortega, E. M. M. and Cordeiro, G. M. (2008). A generalized modified Weibull distribution for lifetime modeling. Computational Statistics and Data Analysis, 53:450462.
[6] Chakrabarty, J. B. and Chowdhury, S. (2018). Compounded inverse Weibull distributions:
Properties, inference and applications. Communications in Statistics - Simulation and Computation, 48(7):2012-2033.
[7] Drapella, A. (1993). The complementary Weibull distribution: Unknown or just forgotten?.
Quality and Reliability Engineering International, 9:383-386.
[8] Efron, B (1988). Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve. Journal of the American Statistical Association, 83:414-425.
[9] Gauthami, P. and Chacko, V. M. (2021). Dus Transformation of Inverse Weibull Distribution: An Upside-Down Failure Rate Model. Reliability Theory and Applications, 16(2):58-71.
[10] Gusmao, F. R. S., Ortega, E. M. M. and Cordeiro, G. M. (2011). The generalized inverse Weibull distribution. Statistical Papers, 52:591-619.
[11] Kavya, P. and Manoharan, M. (2021). Some parsimonious models for lifetimes and applications. Journal of Statistical Computation and Simulation, 91(18):3693-3708.
[12] Khan, M. S., Pasha, G. R. and Pasha, A. H. (2008). Theoretical Analysis of Inverse Weibull Distribution. WSEAS TRANSACTIONS on MATHEMATICS, 7:30-38.
[13] Kundu, D. and Howlader, H. (2010). Bayesian inference and prediction of the inverse Weibull distribution for Type-II censored data Computational Statistics and Data Analysis, 54:1547-1558.
[14] Lawless, J. F. Statistical Models and Methods for Lifetime Data, John Wiley & Sons, New York, 1982.
[15] Lee, E. T. and Wang, J. W. Statistical Methods for Survival Data Analysis, John Wiley & Sons, Inc., Hoboken, New Jersey, 2003.
[16] Mahmoud, M. A. W., Sultan, K. S. and Amer, S. M. (2003). Order statistics from inverse weibull distribution and associated inference. Computational Statistics & Data Analysis, 42:149163.
[17] Manoharan, M. and Kavya, P. (2022). A New Reliability Model and Applications. Reliability Theory and Applications, 17(1):65-75.
[18] Mudholkar, G. S., Srivastava, D. K. and Kollia, G. D. (1996). A Generalization of the Weibull Distribution with Application to the Analysis of Survival Data. Journal of the American Statistical Association, 91.
[19] Nassar, M. M. and Eissa, F. H. (2003). On the Exponentiated Weibull Distribution. Communications in Statistics - Theory and Methods, 32(7):1317-1336.
[20] Weibull, W. (1951). A statistical distribution function of wide applicability. Journal of Applied Mechanics, 293-297.