Научная статья на тему 'ON THE INFERENCES AND APPLICATIONS OF WEIBULL HALF LAPLACE{EXPONENTIAL} DISTRIBUTION'

ON THE INFERENCES AND APPLICATIONS OF WEIBULL HALF LAPLACE{EXPONENTIAL} DISTRIBUTION Текст научной статьи по специальности «Науки о Земле и смежные экологические науки»

CC BY
94
39
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
Laplace distribution / Half-Laplace distribution / Weibull Half-Laplace distribution / Censored data / Lifetime data / Maximum likelihood estimation

Аннотация научной статьи по наукам о Земле и смежным экологическим наукам, автор научной работы — Adeyinka S. Ogunsanya, Obalowu Job

The study of probability distribution has expanded the field of statistical modelling of real life data. It has also provided solution to the problems of skewed data which often violate the normality. This research work introduces a new T Half-Lapalace{Exponential} family with a novel Half-Laplace distribution as baseline distribution with specific interest in three-parameter lifetime model called the Weibull-Half-Lapalace{Exponential} (W-HLa{E}) distribution. The W-HLa{E} model is capable of modeling various shapes of aging events. The W-HLa{E} distribution is derived by combining Half-Laplace and Weibull distribution using the quartile function of Exponential distribution. Some of its statistical properties such as the mean, mode, quantile function, median, variance, standard deviation, skewness, and kurtosis are derived. Other statistical properties such as survival function, hazard rate, moments, asymptotic limit, order statistics, and entropy which is the measure of uncertainty of a random variable are derived and studied. The parameter estimation method adopted in this study is the maximum likelihood method. The graphs of W-HLa{E} at different values of shape and scale parameters show that the distribution is unimodal hence the mode is given as 𝑚𝑜𝑑𝑒 = 𝜃 + ((k-1)/k)¹/ᵏ and it is positively skewed with a steep peak. A simulation study is carried on the new proposed distribution using maximum likelihood estimation. The simulation also supported the theoretical expression of the statistical properties of the proposed distribution such as the location parameter does not affect the variance, skewness, and kurtosis of the new distribution. The importance and the flexibility of the proposed distribution in modeling some real life data sets is demonstrated in the research. The results of the study shows that the proposed W-HLa{E} distribution perform better than other disributions in the literature.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «ON THE INFERENCES AND APPLICATIONS OF WEIBULL HALF LAPLACE{EXPONENTIAL} DISTRIBUTION»

ON THE INFERENCES AND APPLICATIONS OF WEIBULL HALF LAPLACE{EXPONENTIAL} DISTRIBUTION

Adeyinka S. Ogunsanya* department of Statistics, University of florin, Kwara State, Nigeria Correspondence email address:ogunsanyaadeyinka@yahoo.com,

Obalowu Job

2Department of Statistics, University of florin, Kwara State, Nigeria oobalowu@unilorin.edu.ng

Abstract

The study of probability distribution has expanded the field of statistical modelling of real life data. It has also provided solution to the problems of skewed data which often violate the normality. This research work introduces a new T - Half-Lapalace{Exponential} family with a novel Half-Laplace distribution as baseline distribution with specific interest in three-parameter lifetime model called the Weibull-Half-Lapalace{Exponential} (W-HLa{E}) distribution. The W-HLa{E} model is capable of modeling various shapes of aging events. The W-HLa{E} distribution is derived by combining Half-Laplace and Weibull distribution using the quartile function of Exponential distribution. Some of its statistical properties such as the mean, mode, quantile function, median, variance, standard deviation, skewness, and kurtosis are derived. Other statistical properties such as survival function, hazard rate, moments, asymptotic limit, order statistics, and entropy which is the measure of uncertainty of a random variable are derived and studied. The parameter estimation method adopted in this study is the maximum likelihood method. The graphs of W-HLa{E} at different values of shape and scale parameters show that the distribution is unimodal hence the mode is given k-l !

as mode = 8 + and it is positively skewed with a steep peak. A simulation study is carried on the new proposed distribution using maximum likelihood estimation. The simulation also supported the theoretical expression of the statistical properties of the proposed distribution such as the location parameter does not affect the variance, skewness, and kurtosis of the new distribution. The importance and the flexibility of the proposed distribution in modeling some real life data sets is demostrated inn the research. The results of the sudy shows that the proposed W-HLa{E} distribution perform better than other disribbutions in the literature.

Keywords: Laplace distribution; Half-Laplace distribution; Weibull Half-Laplace distribution; Censored data; Lifetime data; Maximum likelihood estimation

1. INTRODUCTION

Numerous classical distributions have been extensively used over the past decades for modeling data in several areas such as engineering, actuarial, environmental and medical sciences, biological studies, demography, economics, finance, and insurance. However, there is a clear need for extended forms of these classical distributions. Due to that reason, researchers have developed and studied several methods for generating new families of distributions. The most outstanding characteristics of this distribution are that it is unimodal and symmetric. Laplace distribution is a

A. S. Ogunsanya and O. Job RT&A, No 1 (72)

WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

mixture of normal laws (see Kotz [10]), as a possible explanation of the wide applicability of this distribution for modeling growth rates. Let f(x, 8, ft) be the probability density function of Laplace distribution.

f(x,8,/3) = e(-lx-dl/#\x e (-«,«>) (1)

and CDF, F(x) is given as:

(i e(-!$#)' ifX<e

F(x-,e,p) = \ " 1 (2)

(l- 1 e+ $ ) if x>8

The expected value of a Laplace distribution is given as E(x) = 8. The expected value of a Laplace distribution is the same as the location parameter and a symmetric situation means that the mean is the same as mode and median. Laplace distribution can be compared with other symmetric distributions like Normal, Logistic, etc. except that Laplace has a higher spike and slightly thicker tails.

In recent years, Asymmetric Laplace distribution of [10] has received much attention in modeling currency exchange rates, interests, stock price changes which is a modification of Laplace distribution but not for survival data. Many researchers have developed compound distributions using different methods to fit survival data. In this research work, we shall reduce the classical Laplace distribution to a non-negative function. Most real-life quantities to be measured are nonnegative values. With the assumption that the data is non-negative, therefore, it is necessary to reduce the Laplace distribution to one-sided (positively skewed distribution). Thus, the one-sided Laplace distribution, otherwise called the half-Laplace distribution is the positive side of the Laplace distribution.

Let X be a random variable on R+ = ( 0 , given by the density function of Laplace distribution equation (1), where x > 8 > 0 and ft > 0, then x is said to have a half Laplace distribution, denoted by HL(6,P)

f(x,8,p)= ! e $ , x > 8 > 0; ft > 0 (3)

and the CDF of the half-Laplace distribution is

1 i%—)

F(x,8,P) = 1-1 e $ ),x>8>0;P>0 (4)

Where 8 is the location parameter and ft is the shape parameter The half-Laplace distribution reduces to the exponential distribution when 8 = 0, we have the exponential distribution. Recently, many researchers have developed and studied compound distributions using T-X which was introduced by Alzaatreh [5] and T-X{Y} by Aljarrah [3]. This was later modified by Alzaatreh [6] as T-gamma family, Alzaatreh [7] also constructed T-normal families. Almheidat [4] studied the T-Weibull family. Amalare [8] derived Lomax-Cauchy {Uniform}. Ogunsanya [13] developed and studied the extension of Cauchy distribution named Rayleigh Cauchy distribution, Ogunsanya [14] studied Weibull-Inverse Rayleigh distribution: Classical/ Bayesian approach and Job [11] applied Weibull Loglogistic{Exponential} distribution on some survival data. In Section 2, we derive the new Weibull Half Laplace distribution, with statistical properties such as hazard function, survival function, Moments, skewness, kurtosis, order statistics, and Shannon entropy are determined. Section 3 shows the simulation study. Estimation of the parameters of W-HLa{E} distribution by maximum likelihood is performed in Section 4. In Section 5, the performance of the new W-HLa{E} distribution is demonstrated on two real data sets and the conclusion and summary of the work are expressed in Section 6.

A. S. Ogunsanya and O. Job RT&A, No 1 (72) WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

2. DERIVATION OF WEIBULL-HALFLAPLACE {EXPONENTIAL}

DISTRIBUTION

In this section, we investigate in details the properties, parameters estimation, and applications of a new distribution of the T-Half Laplace (7) family called Weibull- Half Laplace {Exponential} (W-HLa{E}) distribution.

Let t be a random variable that follows a two-parameter Weibull distribution, then PDF is given as

(k/t\ '

f/(t;A,k) = -

And the CDF is

FT(t;A,k) = l-e~+'J ;t^0,A,k^0 (5)

Recall Equation (4), and given the quantile function of exponential distribution as -blog[1-x]

Fx(x) = FA-blogil - (Fr(x))]} then the CDF of proposed W-HLa{E} distribution is

Fx(x) = F/ \-blogRl-(l-e+-X-"#,JS} (6)

Substituting the CDF of Weibull distribution in equation (6)

Fx(x) = l - (x~e)k (7)

Let = y in (7), then we have

Fx(x) = l- e~(r)k(x~0)k (8)

Hence the corresponding PDF using equation (8)

fx(x) = k (x - e)0-!e+& (x~8)k (9)

Let -5 = y in (9) or differentiate equation (8) with respect to x, then we have

fx(X) = kY0(x - 0)k-1eyk(x-d)k (10)

where k,6,Y > 0 are parameters of W-HLa{E} distribution

3. STATISTICAL PROPERTIES OF W-HLA{E} DISTRIBUTION

The statistical properties of the W-HLa{E} distribution including quantile function, ordinary moments, and Shannon entropy are provided in this section

Proposition 1 (Quantile Function) If X is a random variable that has W-HLa{E} distribution (x; k, 9, y) and let Qx(p), such that 0 <p < l denote the quantile function for the W-HLa{E} distribution. Then Qx (p) is given by

Qx (P) = e+ 6 (-log(l-p)}^ (11)

where k,6,Y > 0 are parameters of W-HLa{E} distribution Proof:

From Equation (8) replace Fx (x) with p and solve for x, we obtain (11), the quantile function of W-HLa{E} distribution.

Settingp = 0.25,0.50,and0.75 in (30)he quartiles of the W-HLa{E} distribution can be obtained. Lower Quartile

Qx(0.25) = 6 + 6{-log(l - 0.25)}k

Median

Qx(0.5) = d+ 6{-log(l - 0.5)}k

Upper Quartile

A. S. Ogunsanya and O. Job RT&A, No 1 (72) WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

Qx(0.75) = 8 + !{-log(1 - 0.75)}%

Proposition 2 (Modal Function) If X is a random variable that has W-HLa{E} distribution^; k,8,y) and let XMoie (x), such that 0 <p < 1 denote the mode function for the W-HLa{E} distribution. Then Qx (p) is given by

Xmoae(x) = d + ! [^f (12)

where k,8,y > 0 are parameters of W-HLa{E} distribution Proof:

Differentiate (10) and equate to zero

d

Txf(x) = 0

d

dx I

-¡¡-[kyk (x-8)k-1e-6k(x-()k-1] = 0

k(k - l)Yk(x - 8)k-1e-6k(x-e) k2y2k(x - e)2k-1e-6k(x-8) _ (x-8) (x-8) =0

k(k - 1)yk(x - e)k-1e-6k(x-6)k-1 _ k2y2k(x - e)2k-1e-6k(x-6)k-1 (x-8) = (x-8)

Solving for x, we have

1

k-1

_ 1

Xmode (x) = 9 + ~

3.1. Shape Properties of W-HLa{E} Distribution

Cumulative Distribution Function (CDF) of W-HLa{E} Distribution. Equation (9) is now the CDF of the new probability distribution called W-HLa{E} Distribution.

3.2 Hazard Function The hazard function of the W-HLa{E} distribution is derived from this definition

hx(x) = fx(x)/1 - Fx(x)

where fx(x) and Fx(x) are the PDF and CDF of W-HLa{E} distribution given in (9) and (10) respectively. The hazard function h(x) can be written as

hx(x) = k6 ( I mk--(13)

Simplifying (13), we have

hx (x) = ky k (x-8)k-1 (14)

The log hazard of W-HLa{E}D, which is frequently used in modeling is given by

Ax (t) = log (hx (x)) Ax(t) = log(kyk(x - 8)k-1) (15)

Expanding equation (15)

Ax(t) = log(kyk) + log(x - 8)k-1 Ax(t) = log(kyk) + (k-1)log(x- 8) (16)

Density plot of WLLED

Empirical CDf OF WLLED

x (a) x (b)

Figure 1: (a) Density plot and (b) CDF plot of WLE distribution for sample size = 1000 and for various values of k, 9 and y.

Figure 1(a) ahows that the PDF is positively skewed. Figure 1(b) shows that the CDF of W-HLa{E}D increases as x increases and remains constant as it approaches 1.

3.3 Survival Function The survival function of the W-HLa{E} distribution is derived from this definition

Sx M = l- Fx 00

where F(x) is the CDF of the W-HLa{E} distribution as defined in equation(9) The survival function Sx can be written as Sx(x) = l - (l- e-yk(x-d)k}

Sx (x) = e-yk(x-e)k (17)

For x > 0,k, dandy > 0 and t > 0, the probability that a system having age x units of time will survive up to x + t units of time is given by

e-yk(x+t-e)k

sx oo =

-yk(x-8)k

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

3.4 Cumulative Hazard Function

The cumulative hazard function, of the W-HLa{E} distribution, is given as

Hx(x) = -loge{e-yk(x-e)k} (18)

Simplifying (18) we have

Hx(x) =yk(x- B)k (19)

3.5 Asymptotic Behavior of W-HLa{E} Distribution

To investigate the asymptotic behavior of the proposed distribution model W-HLa{E}, we find the limit as x —>8 and as x —> ro of the W-HLa{E} distribution

limx^d f(x) = limx^dkyk(x - 8)k-1e-yk(x-e)k = 0 Proof: Since limx^e(x - 8) = 0, then limx^e f(x) = 0

Hence as x tend to a minimum value of the distribution, W-HLa{E} distribution becomes zero Similarly,

Umx^?f(x) = limx^dkyk(x - e)k-1e-yk(x-e)k = 0 Proof: Since limx^? = (e-yk(x-e')k^ = 0, then lim.x^?f(x) = 0

A. S. Ogunsanya and O. Job RT&A, No 1 (72)

WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

As x tends to infinity the W-HLa{E} distribution becomes zero

3.6 Moments and Variance

In this subsection, we shall determine rth moment of about the origin and nth moment about the mean. Given

E(X) = f? xfx(x)dx (20)

Proposition 3 (First Moment about Origin) If X is a random variable that has W-HLa{E} distribution(x;k,9,y) and let XM denote the first moment about the origin of the W-HLa{E} distribution. Then is given by

Xm = 8+1r(1+1)

where k,8,y > 0 are parameters of W-HLa{E} distribution Proof: Substituting (10) in (20)

E(X) = f? xkyk(x - e)k-1e-yk(x~8)kdx (21)

Let

z = yk(x- B)k (22)

then differentiate Equation (22) with respect to x

9z

dx = ———%-r (23)

kyk(x-d)k-1 v '

Substitute (23 ) in (21), we have

E(X) = f? xkyk(x - 8)k-1e-yk(x-e) (24)

E(X) = f? xe-vk(x-6)kdz (25)

From (22), we make x the subject,

zi/k

x = e + — (26)

0<z<™ Substitute (22) and (26) in Eqn. (25), we have

E(X) = f? {B+ e~zdz (27)

)/k

E(X) = f? 8e-zdz + f? z— e-zdz (28)

By evaluating the limits, we have

E(X) = -8e-yk(x-Vk I? + 6 r(l+ k) (29)

Hence equation (29) becomes

E(X) = d + 6 r(l+ k) (30)

Proposition 4 (Second Moment about Origin) If X is a random variable that has W-HLa{E} distribution(x; k,9,y) and let n2 denote the second moment about the origin of the W-HLa{E} distribution. Then n2 is given by

^=d+2lr(i+1k)+?" r^+D

where k,8,y > 0 are parameters of W-HLa{E} distribution Proof: Given

E(X2) =ii2 = f? x"fx(x)dx (31)

Substitute equation (10) in equation (31)

¡j" = f? x2kyk(x - d)k-1e-6k(x-e)kdx (32)

Using the above method adopted in proposition 4.4, we have,

»2 = 82 + 2 ( r(k + k)+7 r(k + k) (33)

Corollary 1 (rth moment) Let X be a random variable that follows W-HLa{E} distribution(x; k, 6,y) and let Hr denote the rth moment about the origin of the W-HLa{E} distribution. Then nr is given by

= er + XU Q ^ r(l + k) (34)

Where i=1,2,3,...,r

Proof: By Mathematical induction, it follows from equations (30) and (33) of propositions 3 and 4. respectively

Hence the first four moments of the proposed distribution are given

"i =d+lr{l+l) =e"+2e-r[l+i)+is ril+D

3 9" ( l\ e ( 2\ l ( 3\ = *3 + 3Jr{l+l) + 3T2 r(l+^+:~S r{l + 33) . e3 ( l\ e2 / 2\ b ( 3\ l ( 4\

Again using the relationship between moments about mean and moments about the origin, the moments about the mean of W-HLa{E} distribution are obtained as

Variance(ji2) =E(X2)- [E(X)]2

Variance^) = 62 + ^(l + + 1r (l + 2) -[d + lr (l +

= r(l+ k)-fr(l+ k)f

=! {r(l+2)-[r(l+!)]2}

Therefore the standard deviation is given as

StandardDeviation(X) =

F {r(1+^)-[r(1+1)]2}

Co — efficient of Variation(C. V) = ■

StandardDeviation

E(X)

c.v =

Hi+ih Ki+si)

Again using the relationship between moments about mean and moments about origin where M3 = M3 - 3t!^2 + 2p.! and = - + + are third and fourth moments about

the mean respectively. The moments about the mean of W-HLa{E} distribution are obtained as

= e3 +3 T(l + kW + i r(l + k) + yi r(l + i)-3[» +1 r(l + SH«1 + 2 i r(l + !) +1 r(l + k)] + 2\e + y r(l + !)]3

= + 4 f r(l+ !) + 6 £ r(l + k) + 4 ( r(l + ± r(l + H) -*[* +1 r(l + i)][«3 + 3 r(l + !) + 3 y. r(l + $ r(l+ k)] +6[e + ! r(l + k)]2 [e2 + 2 y r(l + !) + y. r(l + f)] + 3[S + 6 r(l + 0)]h

Hence, the skewness and kurtosis are determined as follows;

Skewness =

Skewness =

!

E(x-J)1

J3

l03 + 3 7 T(1 + 0) + 3 ( TV1 + 0) + ! TV1 + 0)

3ld+yT{1 + k)]ld2 + 2YTi1 + k) + r" T{1 + k)} + 2ld + yT{1 + k) Further simplification of above we have

= ^ = 2r(1 + l)2-3r(1+l)r(1 + j2) + r(1+3)

r(1+k)-[)r(1+1)]2

Y V-4

Kurtosis = —

O4

Kurtosis =

-!-.W + 4 6 r(1+ !) + 6 r(1+ 0) +

^('.a-K^n)' 6 ( 0> "( k)

4 ( r(1+ 0) + r(1+ 4)-4[i,+ 6 r(1+ i)][03 +3 £ r(1 + 0) + 3 ( r(1 + 0) + ! r(1 + 0)] + 6[e + ! r(1 + 0)]2 [o2 + 2 6 r(1+ 0) + ! r(1+ 0)] + 3 [e + ! r(1 + 0)]4)

3.7 Order Statistics

3

Order statistics is an important concept in probability theory. Let a random sample X1,X2,...Xn, from the distribution function F(x) and corresponding pdf f(x), therefore the pdf of ith order statistic is given as

Proposition 5 If X is a random variable that has W-HLa{E} distribution^; k, 8, y) and let f(xE) denote the pdf of ith order statistic which is given as

We) = k9Pyk ^ ^ - kJ kJ (-DP+q (e-6k(x-d'k)1+q+n~l(35)

where k,8,y > 0 are parameters of W-HLa{E} distribution Proof: Given

f(xE) = (i-1"(n-i)! f(x)F(x)i-1[i - F(x)]n-1 (36)

hence the pdf of ith order statistic of W-HLa{E} distribution is determined by substituting equation (7) and (10) in equation (35) we have

f(Xi) = Q kyk(x - 8)k-1e-rk(x-()k [i - e-Yk(x-8)kji-1 [e-Ykix-0)kjn-1 (37)

f(Xi) = (i-1*n-0! kYk X?=0 H - kJ (-1)P 0pe-6k(x-e)k Y?=0 kJ (-1)q (e-6k(x-()k)q (38)

f(x<) = T-Sn-o'. kdPvk H - 1J 1J (-Dp+q (e-6k(x-e)k fq+n~i

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

Therefore the first and nth order statistics for W-HLa{E} distribution can be determined as thus Corollary 2 (nth Order Statistic) Let X be a random variable that follows W-HLa{E} distribution(x; k,8,y) and let f(x±) denote the first (1st) order statistic of the W-HLa{E} distribution. Then f(xx) is given by

f(Xi) = k8pyk Z?qq=0 I* - 1J (-1)S+q (e-6k(x-()k)n+q) (39)

Where i=1,2,3,...,n

Proof: From equation (35), replace i with 1 we have equation (39)

A. S. Ogunsanya and O. Job RT&A, No 1 (72) WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

Corollary 3 (nth Order Statistic) Let X be a random variable that follows W-HLa{E} distribution(x; k, 8, y)

and let f(xn) denote the last (nth) order statistic of the W-HLa{E} distribution. Then f(xn) is given by

f(Xi) = n-inqyw. k°PYk (p - lJ (-DP+q (e-yk(x-6)kT") (40)

Where i=1,2,3,...,n

Proof: From equation (35), replace i with n we have equation (40)

3.8 Entropy

In information theory, entropy is an important concept and can be defined as a measure of the randomness or uncertainty associated with a random variable. However, the Shannon entropy for a random variable X with pdf fx(x) is defined as E{-log(fx(x))}

Proposition 6 If X is a random variable that has W-HLa{E} distribution(x; k,9,y) then Shannon's entropy is given as

0.57722

E{-log(fx(x))} = yk -(k-l) (—-J - log(y) - log(k)

where k,8,y > 0 are parameters of W-HLa{E} distribution and O = - logze-zdz « 0.57722 is the Euler gamma constant

Proof: Substitute equation (10) in the definition of entropy we have

E{-log(fx(x))} = E [-log (kyk(x - 8)k-1eyk(x-e)k)} (41)

E{-log(fx(x))} = E{-[log(k) + klog(y) + (k - l)log(x -8)- yk(x - 8)k]}

= E{yk(x - 8)k - [log(k) + klog(y) + (k - l)log(x - 0)]} (42)

Finding the expectation of (x - 8) with the proposed distribution W-HLa{E}, we have,

E((x - 8)k) = I( (x - 8)kkyk(x - B)k-1e-yk(x-6)kdx (43)

Recall Equation (22) and (23)and substitute them in equation (43)

E((x - 8)k) = (x - 8)kkyk(x - 8)k-1e-yk(x-6)k x kyk^e)k-1 (44)

E((x - 8)k) = JQ? (x - 8)ke-yk(x-d)kdz (45)

Substituting Equation (26) in Equation (45) and integrating the expression

E((x - 8)k) = f? zezdz = [-zez - ez]? = l (46)

Hence

E(yk(x-8)k )=y k E((x-8)k) = yk (47)

Also

E((x - 8)k) = f? log(x - 8)kfx(x)dx (48)

Therefore

E(log(x - 8)k) = f? log(x - 8)kkyk(x - 8)k-1e-yk(x-e)kdx (49)

Substitute Equation (22) and (23)and substitute them in equation (49)

E(log(x - 8)k) = f? log(x - 8)kkyk(x - 8)k-1e-yk(x-e)k x (50)

Then equation (50) becomes

= f? log(x - 8)ke-zdz (51)

= f? log(^e-zdz (52)

= j log (zkJ e-zdz + J log (-J e-zdz

= f? log (zk) e-zdz - f? log(y)e-zdz (53)

= f? log (z%)e-zdz - log(y)f? e-zdz (54)

= k f? lo9(z)e-zdz - log(y) f? e-zdz (55) Let O = - f0^ log(z)e-zdz then equation (55) becomes

A. S. Ogunsanya and O. Job RT&A, No 1 (72) WEIBULL HALF LAPLACE {EXPONENTIAL} DISTRIBUTION_Volume 18, March 2023

E(log(x-8)k) = - W -log(Y) (56)

Substitute Equations (47) and (56) in Equation (42), we have

E{-log(fx(x))} = Yk -(k-1){- W - log(Y)) - (log(k) + klog(Y)) (57) E{-log(fx (x))} = Yk + (k-1){W + log(Y)) - log(k) - klog(Y) E{-log(fx(x))} = Yk + (k- 1) (W) - log(Y) - log(k) where k,6,y > 0 are parameters of W-HLa{E} distribution and O = - f™ logze-zdz « 0.57722 is the Euler gamma constant from Abramowitz [1]

E{-log(fx(x))} = Yk + (k-1) (^^p1) - log(Y) - log(k) (58)

3.9 Mean Residual life

The Mean Residual Life (MRL) at a given time t measures the expected remaining life of an individual of age t. it otherwise called the life expectancy.

Proposition 7: Let T be a random variable that follows W-HLa{E} distribution(t, k,y, 8) and let MRL represents mean residual life at a given time t W-HLa{E} distribution. Then MRL is given by

MRL = —-^-%\1r(1+1) + 8e-t -~v{1 +1,t)]-t e-jy)k(t-8)k ly V k) y V k )]

where k,Y,6 > 0 are parameters of t W-HLa{E} distribution

Proof: Given

MRL = ^[E(x) - £ tfx(t)dt\ - t (59)

And Sx(x) = 1-F(x)

From equation (8) and (34)

Fx(t) = 1 - e-T6)k(t-()k, E(X) = 8+^(1+^)

MRL = [e + 6r(1+k)- f° tfx(t)dt)-1 (60)

ft tfx(t)dt = ft (d+ e~ldt

ft ft t1/k

= I 8e-tdt+ I -e-tdt

Jo Jo Y

= -8e-t° + 1^Y{1 + 111,t)

ft tfx(t)dt = 8 - 8e-t + 6r (1 + k,t) (61)

Substituting equations (61) in equation (60)

MRL = i-6)W {e + 1r{1 + /1)- {6-6e-t +1r{1+Pc))}-t

Where r (1 +1 ,t) is an incomplete gamma function of variable t.

MRL = i-6)Wl1r{1 + k) + de-t-^r{1 + k,t] T-t

4. SIMULATION STUDY

The simulation was done using equation (11) which is the quantile function of W-HLa{E}. Let p be a

1 )

uniform random variable on (0,1), then X = 8 + -{-log(l - p)}%, the descriptive summaries were obtained through Statistical software R3.4.4.version.

Table 1: Descriptive summaries of simulation of W-HLa{E} distribution with various parameters

Model Parameters Mean Median Max. Variance skewness kurtosis Cv

J1 k=0.5; y=0.5; 0=5 6.801 5.453 21.337 14.239 3.115 12.309 0.554

J" k=0.5; =0.5;0=10 11.8 10.45 26.34 14.239 3.115 12.309 0.320

Jg k=1; y=3; 0=5 10.22 10.16 10.95 0.055 1.748 5.827 0.023

y4 k=1.5; y=1; 0=5 5.67 5.609 7.014 0.251 1.040 3.751 0.088

ys k=2; y=2; 0=10 10.35 10.34 10.85 0.042 0.612 3.008 0.036

y6 k=2; y=2; 0=5 5.352 5.345 5.845 0.042 0.612 3.008 0.036

y7 k=3; y=1; 0=5 5.76 5.781 6.419 0.097 0.127 2.625 0.054

y8 k=3; y=1; 0=10 10.76 10.78 11.42 0.097 0.127 2.625 0.029

From table 1 it is observed that as parameter k increases the variance, skewness, and kurtosis function decreases for different values of the scale and location parameters of W-HLa{E} distribution hence, the skewness and kurtosis are decreasing functions of k.

When k=2 and y=2, the mean is approximately equal to the median hence, the distribution tends to be symmetric. Table 1 also shows that the coefficient of variation is deeply affected by y. When the y increases positively the value of the coefficient of variation decreases

5. ESTIMATION OF PARAMETERS FOR THE W-HLA{E} DISTRIBUTION

The Maximum Likelihood estimates of W-HLa{E} distribution will be obtained in this section Definition: Let x1,x2,...,xn denote a random sample drawn from W-HLa{E} distribution with parameters k,y,8. The likelihood function l (x, k, y, 8) of W-HLa{E} distribution is defined to be the joint density of the random variables x1, x2,...,xn

l(x, k, y,8) = n kyk(x - 8)k-1eyk(x-e)k (62)

= knynk n (x - 8)k-1e-yk(x-e)k (63) finding the loglikelihood function of equation (63), we have

logl(x, k, y, 8) = \ogkn + \ogynk + Ym log(x - 8)k-1 + Yn - Yk((x - 8)k) (64) where L=logl (x, k,y,8)

L = n\ogk + nklogy + (k - l) Yf \og(x -8)-yk Yf ((x - 6)k) (66)

L = n\ogk + nklogy + (k - l) Yn \og(x -8)-yk Yn ((x - 8)k) (66) Differentiating the log-likelihood function in (66) with respect to the parameters^, y, 8 we have

= f - kyk-1 Yn (Xi -d)k (67)

9kk = n + n\ogy Yn (Xi -8)k -yk Yn (Xi - 8)k\og(Xi - 8) (68)

= kykYn (Xi - e)k-1 -(k-DYn-^ (69)

The maximum likelihood estimates (MLE), k,y,8 for the parameters k,y,8 respectively, are obtained by setting (67) - (69) to zero and solving them simultaneously.

6 APPLICATIONS

In this section, we shall be investing the importance of the new distribution W-HLa{E} distribution and apply it to three real-life data. In these applications, the maximum likelihood estimation method is used in these two applications to estimate the parameters of fitted distributions. The maximized log-likelihood, the Kolmogorov-Smirnov test (K-S) along with the corresponding p-value, the Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) are reported to compare the W-HLa{E} distribution with the other distributions.

DATA SET I: Remission times of bladder cancer patients

Remission Time is the time taken for the signs and symptoms of a particular disease, in this case, cancer, to decrease or disappear after treatment. Though cancer may be considered in remission, cancer cells may remain in the body.

Table 2: Remission times (in months) of bladder cancer patients' data

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

0.080 0.200 0.400 0.500 0.510 0.810 0.900 1.050 1.190 1.260 1.350 1.400 1.460 1.760 2.020 2.020 2.070 2.090 2.230 2.260 2.460 2.540 2.620 2.640 2.690 2.690 2.750 2.830 2.870 3.020 3.250 3.310 3.360 3.360 3.480 3.520 3.570 3.640 3.700 3.820 3.880 4.180 4.230 4.260 4.330 4.340 4.400 4.500 4.510 4.870 4.980 5.060 5.090 5.170 5.320 5.320 5.340 5.410 5.410 5.490 5.620 5.710 5.850 6.250 6.540 6.760 6.930 6.940 6.970 7.090 7.260 7.280 7.320 7.390 7.590 7.620 7.630 7.660 7.870 7.930 8.260 8.370 8.530 8.650 8.660 9.020 9.220 9.470 9.740 10.06 10.34 10.66 10.75 11.25 11.64 11.79 11.98 12.02 12.03 12.07 12.63 13.11 13.29 13.80 14.24 14.76 14.77 14.83 15.96 16.62 17.12 17.14 17.36 18.10 19.13 20.28 21.73 22.69 23.63 25.74 25.82 26.31 32.15 34.26 36.66 43.01 46.12 79.05

Table 2 shows data of remission times(in month) of 128 bladder cancer patients selected at random as reported by Lee, et al [12], which was studied by Zea [16] to compare the fits of a different family of beta-Pareto(BP) and beta exponentiated Pareto (BEP) distributions. Almheidat [4] also applied to four parameters Cauchy-Weibull {logistic} (C — W{L}) distribution in fitting this same data and just of recent Aldeni [2] applied uniform-exponential{generalisedlambda} distribution (U — E{GL}) distribution to fit the same data

Table 3: Descriptive Statistics of remission times of bladder cancer patients distributions

Min. Max. Mean 1st Qu. Median 3rd Qu. Skewness Kurtosis SD

0.080 79.050 9.366 3.348 6.395 11.838 3.287 18.483 10.508

Table 4: Performance of the distributions remission times of bladder cancer patients distributions Parameter estimates: Log-likelihood, AIC, and p-value (Standard errors in parentheses)

Distributions WHLa[E} U - E[GL} C - W{L} BEP BP

k = 0.4847 0=0.2757 a=2.3040 a = 0.348 a = 4.805

(0.0267) (0.0665) (1.0937) (0.0970) (0.0550)

y = 0.4850 A3 = 2.504 ß = 2.0205 b =159831 b = 100.502

(0.0651) (0.9285) (0.4585) (183.7501) (0.2510)

y. = 0.0785 Aa = 0.2894 k=3.0673 k= 0.051 k = 0.011

(-) (0.0858) (0.7319) (0.0190) (0.0010)

A = 12.663 ß = 0.080 ß = 0.080

(2.6326) (2.0930)

Table 5: Performance of the distributions remission times of bladder cancer patients distributions Parameter estimates: Log-likelihood, AIC, and p-value

Distributions WHLa[E} U - E{GL} C - W[L} BEP BP

-Loglikelihood 271.3276 409.45 416.0965 432.41 480.446

AIC 548.6552 824.9 840.2 874.819 968.893

K-S 0.0078125 0.02876 0.06672 0.142 0.217

P-value 0.9922 0.9999 0.6189 0.0121 1.11E-05

Table 3 shows the summary of the dataset I and table 4 displaces the parameters estimates of W-HLa{£] and four other distributios. Table 5 shows the values of parameter estimates, log-likelihood, AIC, K-S, and its p-value at 95%. Based on the above test statistics, W-HLa{£] has the least AIC with 548.6552 and K-S Statistic (0.0078125) hence W-HLa{E} performed best among the five distribution models applied to remission time of bladder cancer. This implies that the new distribution can fit skewed data with long-tail better than any distribution. After an appropriate distribution has been identified and parameters estimated, we can estimate the probability of having a given duration of remission and other probabilities. For example, the probability of having a remission time longer than 10 months can be predicted as P(X >x) = e-y%(x-0)% When k = 0.4847, y = 0.4850,0 = 0.0785

P(X > X) = e-O.H850OAS47(10-0.0785)OAS47

P(X > 10) = 0.117

Data Set II: 72 pigs infected by virulent tubercle Bacilli (Bjerkedal, T, 1960)

The data in Table 6 are survival times (in days) of seventy-two pigs infected by virulent tubercle bacilli [9] The data in Table 6 are survival times (in days) of seventy-two pigs infected by virulent tubercle bacilli reported by Tahir [15] The Tables 4.4 shows the performance of W-HLa{E} and three other models (Logistic Frechet (LFr), Marshall-Olkin Frechet (MOFr), exponentiated-Frechet (EFr) and Frechet (Fr). Table 6: Infected Pigs data (in day) 10 33 44 56 59 72 74 77 92 93 96 100 100 102 105 107 107 108 108 108 109 112 113 115 116 120 121 122 122 124 130 134 136 139 144 146 153 159 160 163 163 168 171 172 176 183 195 196 197 202 213 215 216 222 230 231 240 245 251 253 254 254 278 293 327 342 347 361 402 432 458 555

Table 7: Descriptive Statistics of Infected Pigs data

Min. Max. Mean 1st Qu. Median 3rd Qu. Skewness Kurtosis SD

43.00 598.00 141.85 82.75 102.50 149.25 2.5153 9.332 109.209

In this application, we obtain the descriptive statistics, maximum likelihood estimates of the parameters of the fitted distributions, and the values of the following statistics: AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion), and HQIC (Hanna Quinn Information Criterion).

In addition, we compute some goodness of fit statistics to verify which distribution provides the best fit to the data sets. We apply Kolmogorov-Smirnov (K-S) statistics. These statistics are described in detail in Tables 8 and 9. In general, the smaller the value of these statistics, the better the fit of the data by the distribution.

Table 8: Parameter Estimates and Standard errors in parentheses for W-HLa{E} Distribution

Distributions WHLa{E} LFr MOFr EFr

6 = 42.99956 A=32.5054 a=212.7251 a= 155.680

(16.8814) (0.0665) (115.7064) (5.7063)

Parameter y = 0.4850 A3 = 2.504 P = 2.0205 b = 159831

Estimates (0.0651) (0.9285) (0.4585) (183.7501)

p = 0.0785 XA = 0.2894 k=3.0673 k= 0.051

(-) (0.0858) (0.7319) (0.0190)

A = 12.663 P = 0.080

(2.6326) (2.0930)

skewness = 1.841076 and kurtosis = 7.49277

Table 9: Log-likelihood, AIC, BIC, HQIC, KS Statistic and p-value of 72 pigs infected by virulent tubercle Bacilli of different distributions

Distributions WHLa{E} LFr MOFr EFr

-Loglikelihood 4103371 426.2306 426.6764 449.7452

AIC 826.6741 858.4612 859.3527 903.4903

BIC 833.5041 865.2912 866.1827 908.0437

HQIC 829.3932 861.1803 866.1827 905.3030

K-S 0.027778 0.0695 0.1213 0.0923

P-value 0.9460 0.8773 0.2403 0.5710

From table 9, the new proposed W-HLa{E} model corresponds to the lowest values of the loglikelihood, AIC, BIC, HQIC, and K-S statistics among the fitted LFr, MOFr, and EFr models and therefore the W-HLa{E} model can be chosen as the best for the data set above. The distribution with the lowest Akaike Information Criteria (AIC) or BIC and the lowest Log-likelihood value is declared as best fit" distribution. In this case, W-HLa{E} distribution has the lowest Log-likelihood of -410.3371 with the lowest corresponding lowest AIC value of 826.6741. Hence, W-HLa{E}D is regarded as a best-fit model for this particular data used.

7. CONCLUSION

The research work introduced a new probability distribution called Weibull Half Laplace exponential distribution. Expressions for the probability density function, cumulative distribution function, survival function, and hazard function, and cumulative hazard function of the proposed distribution are derived. Some properties of the proposed distribution such as moments, order of statistics, and Shannon entropy have been studied. The simulation also supported the theoretical expression of the statistical properties of the proposed distribution such as the location parameter does not affect the variance, skewness, and kurtosis of the new distribution. From table 1, when k=2 and y=2, the mean is approximately equal to the median hence, the distribution tends to be symmetric. The maximum likelihood method is adopted to estimate the parameters of the distribution. Coefficient of variation is deeply affected by y. When the y increases positively, the value of the coefficient of variation decreases It is shown, by means of two real data sets. Two life data were applied to the new distribution and others established distributions by researchers in the field of probability distribution and we found out that W-HLa{E} distribution has the lowest log-likelihood and the smallest AIC, BIC, and HQIC for the two data sets in tables 6 and 9. W-HLa{E} distribution is a better fit for the two data sets.

References

[1] Abramowitz, M, Stegun, IA (1964). Handbook of mathematical functions with formulas, Graphs and mathematical tables, Dover Publication, Inc., New York.

[2] Aldeni, M., Lee, C. and Famoye, F. (2017). Families of distributions arising from the quantile of Generalized lambda distribution. Journal of Statistical Distributions and Applications.

[3] Aljarrah, MA, Lee, C. and Famoye, F (2014): On generating T-X family of distributions using quantile functions. J. Stat. Distrib. Appl. 1, 1-17

[4] Almheidat, M., Famoye, F. and Lee, C. (2015). Some Generalized Families of Weibull Distribution: Properties and Applications, International Journal of Statistics and Probability;4(3). 18-35.

[5] Alzaatreh, A., Lee, C., Famoye, F. (2013). A new method for generating families of continuous distributions, METRON 71,63-79. DOI 10.1007/s40300-013-0007-y

[6] Alzaatreh, A., Lee, C. and Famoye, F. (2014). T-normal family of distributions: a new approach to generalize the normal distribution. J. Stat. Distrib. Appl. 1, pp. 1-16.

[7] Alzaatreh, A, Lee, C. and Famoye, F. (2015). Family of generalized gamma distributions: properties and applications. To appear in Hacettepe Journal of Mathematics and Statistics

[8] Amalare, A. A., Ogunsanya, A. S. Ekum, M. I. and Owolabi, T. (2020). Lomax Cauchy{Uniform} Distribution: Properties and Application to Exceedances of Flood Peaks of Wheaton River, Benin Journal of Statistics, 3, 66- 81.

[9] Bjerkedal, T (1960): Acquisition of Resistance in Guinea Pigs Infected with Different Doses of Virulent Tubercle Bacilli. Am. J. Epidemiol. 72, 130-148.

[10] Job, O. and Ogunsanya, A. S. (2022). Weibull Log Logistic {Exponential} Distribution: Some Properties and Application to Survival Data. International Journal of Statistical Distributions and Applications. 8(1), 1-13. doi: 10.11648/j.ijsd.20220801.11

[11] Kotz, S. Kozubowski, T. J. and Podgorski, K. (2001): The Laplace Distribution and Generalizations A Revisit with New Applications, Springer, ISBN: 0-8176-4166-1

[12] Lee, E.T. and Wang, J.W. (2003) Statistical Methods for Survival Data Analysis. John Wiley: New York

[13] Ogunsanya, A. S., Akarawak, E. E. and Ekum, M. I. (2021a) On some properties of Rayleigh Cauchy distribution, Journal of Statistics and Management Systems, DOI: 10.1080/09720510.2020.1822499

[14] Ogunsanya, A.S., Yahya, W. B., Adegoke, T. M., Iluno, C., Aderele, O.R. and Ekum M. I. (2021b). A New Three-parameter Weibull Inverse Rayleigh Distribution: Theoretical Development and Applications. Mathematics and Statistics, 9(3), 249 - 272. DOI:10.13189/ms.2021.090306.

[15] Tahir, M. H., Mansoor, M. and Zubair, M. (2014). McDonald Log-Logistic Distribution with an Application to Breast Cancer Data. Journal of Statistical Theory and Applications. 13(1): 65-82

[16] Zea, L.M., Silva, R.B., Bourguignon, M., Santos, A.M & Cordeiro, G.M (2012): The Beta Exponentiated Pareto distribution with application to bladder cancer susceptibility. International Journal of Statistics and Probability. 1(2), 8-21.

i Надоели баннеры? Вы всегда можете отключить рекламу.