Научная статья на тему 'ON THE PROPERTIES OF GENERALIZED RAYLEIGH DISTRIBUTION WITH APPLICATIONS'

ON THE PROPERTIES OF GENERALIZED RAYLEIGH DISTRIBUTION WITH APPLICATIONS Текст научной статьи по специальности «Науки о Земле и смежные экологические науки»

CC BY
59
32
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
flexibility / transect line / myelogeneous leukemia / odd Lindley Rayleigh / quadratic rank transmutation technique

Аннотация научной статьи по наукам о Земле и смежным экологическим наукам, автор научной работы — Sule Omeiza Bashiru, Ibrahim Ismaila Itopa, Alhaji Modu Isa

In this study, a new three-parameter lifetime distribution called the generalized Rayleigh distribution was introduced. The new model is an extension of classical Rayleigh distribution. An extension of density of the generalized Rayleigh distribution was derived from which some of the statistical and mathematical properties were derived. Some mathematical properties of the distribution were presented such as moments, moment generating function, quantile function, survival function, hazard function, reversed hazard function and odd function. The distribution of order statistic was obtained in which the maximum and minimum order statistics were derived. Estimation of the parameters by maximum likelihood method was discussed. Two real-life application of the distribution was presented and the analysis showed the fit and flexibility of the generalized Rayleigh distribution over odd Lindley Rayleigh distribution and Rayleigh distribution. The analysis showed that the generalized Rayleigh distribution is more effective and robust in fitting the data sets.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «ON THE PROPERTIES OF GENERALIZED RAYLEIGH DISTRIBUTION WITH APPLICATIONS»

ON THE PROPERTIES OF GENERALIZED RAYLEIGH DISTRIBUTION WITH APPLICATIONS

Sule Omeiza Bashiru1, Ibrahim Ismaila Itopa2 & Alhaji Modu Isa3

^Department of Mathematical Sciences, Prince Abubakar Audu University, Anyigba, Kogi State,

Nigeria

3Department of Mathematics and Computer Science, Borno State University, Maiduguri, Nigeria Email: [email protected], [email protected], [email protected]

Abstract

In this study, a new three-parameter lifetime distribution called the generalized Rayleigh distribution was introduced. The new model is an extension of classical Rayleigh distribution. An extension of density of the generalized Rayleigh distribution was derived from which some of the statistical and mathematical properties were derived. Some mathematical properties of the distribution were presented such as moments, moment generating function, quantile function, survival function, hazard function, reversed hazard function and odd function. The distribution of order statistic was obtained in which the maximum and minimum order statistics were derived. Estimation of the parameters by maximum likelihood method was discussed. Two real-life application of the distribution was presented and the analysis showed the fit and flexibility of the generalized Rayleigh distribution over odd Lindley Rayleigh distribution and Rayleigh distribution. The analysis showed that the generalized Rayleigh distribution is more effective and robust in fitting the data sets.

Keywords: flexibility, transect line, myelogeneous leukemia, odd Lindley

Rayleigh, quadratic rank transmutation technique

I. Introduction

Over a long period of time, probability distributions have been established through mathematical and statistical study. The assumed probability model or distributions have a significant impact on the effectiveness of the processes used in a statistical study. As a result, numerous common probability distributions and pertinent statistical techniques are described in the literature. However, there are still a number of issues where the real data set does not fit into any of the conventional or classical probability models. In literature numerous generalized distributions have been developed with common feature of having more parameters. Induction of parameters in existing distribution improves the goodness of fit of the distribution under study and tail properties of a distribution increases.

The Rayleigh distribution is one of the most widely applied probability distributions. The Rayleigh distribution is a special case of Weibull distribution, which was first described in [1]. In areas including project effort loading modeling, survival and reliability analysis, communication theory, physical sciences, technology, diagnostic imaging, applied statistics, and clinical research, the Rayleigh distribution is crucial for modeling and interpreting life-time data. Numerous researchers have developed significant expansions to the Rayleigh distribution in light of its significance and the need to provide this distribution more flexibility.

Numerous expansions of Raleigh distribution have been produced as a result of the significance of Raleigh distribution in numerous disciplines. The generalized Raleigh distribution was proposed

by [2], and its unknown parameters were determined using several estimation techniques. The novel generalization of Rayleigh distribution was developed by [3] by utilizing the conservability approach. The Bayes estimators for the parameter of the Rayleigh distribution using square error and LINEX loss functions were derived by [4]. The transmuted Rayleigh distribution was developed by [5] using the quadratic rank transmutation technique. The transmuted generalized Rayleigh distribution was proposed by [6]. The Weibull Rayleigh distribution was studied by [7]. The parameters estimation of exponentiated Rayleigh based on type II censored data was deliberated by [8]. A new distribution named as Rayleigh-Rayleigh distribution was derived by [9] and motivated by the transformed transformer technique by [10]. In contrast to the Lindley distribution, the Rayleigh distribution, and other generalizations of the Rayleigh distribution, an extension of the Rayleigh distribution was developed by [11] with two parameters having greater flexibility. An extension of the exponentiated Rayleigh distribution known as the Gompertz-exponentiated Rayleigh distribution was proposed by [12] by utilizing the Transformed-Transformer family of distributions' methodology.

This paper proposes a new distribution that generalizes the Rayleigh distribution using the family of distribution proposed by [13]. This motivation behind this work is to improve the flexibility of the Rayleigh distribution to fit varieties of real life data sets arising from different disciplines including unimodal and bimodal shapes. Also, to make the kurtosis more flexible compared to the baseline Rayleigh model, to produce skewness for symmetrical distributions using type I half-logistic family of distributions derived by [13] and bathtub shapes.

II. Methods

2.1 Generalized Rayleigh (GRa) Distribution

In this section, a new continuous probability distribution function known as generalized Rayleigh distribution is derived. Also, some plots of its pdf, cdf, survival function and hazard rate function (hrf) are shown in order to assess the shape of the new distribution in fitting different kinds of data.

Recently, [13] developed the Type I Half-logistic family of distributions with cdf and pdf

given as

1 -

F(x;A,a,P) =-

1 -[H(x;P)]

1 +

f(x;A,a,P) = -

1 -[H(x;p)]a 2Aah(x;p) [H(x;p)]a-1 1 -[H(x;p)]a

1 +

1 -[H(x;P)]a

(1)

(2)

where P is the vector of parameters of the baseline distribution.

where G(x;P) is the cumulative distribution function (cdf) of the baseline distribution with vector of parameter P .

For x > 0,a,A,p > 0, where equations (1) and (2) are the cumulative distribution function and

probability density function (pdf) of the family of distributions.

The cdf and pdf of the Rayleigh (Ra) distribution are given respectively as

H(x;0) = 1 -e

{% )x

(3)

h( x;d) = dxe

(4)

A

A

a-1

To obtain the cdf of the new model, equation (3) is inserted into equation (1) as

1 -

F(x;1,a,0) = -

1 -

1 -e^ K

1 +

1 -

1 -e 1/2'

(5)

On differentiating equation (5) with respect to x, the pdf of the GRa distribution is obtained which is given as

f(x;1,a,9) =

- - 1-1

r --1 a- r -(-1 a

22 xe 1/2; 1 -e - -1 1- 1 -e - -1

(6)

Figure 1: Plots of cdf of the GRa distribution for different parameter values

i

i

X

Figure 2: Plots of pdf of the GRa distribution for different parameter values where x > 0,6> 0 is the scale parameter and a,l> 0 are the shape parameters respectively.

2.1.1 Expansion of density

In this section the pdf in equation (6) is expanded using binomial expansion. Expanding the last term in equation (6), we have

f(x-,X,a,6) = 2Xa0xe

1 - e-V X

1 -

1 - e

1 +

1 -

1 -e

1 +

1 -

1 -

1 -e

-V X

'1 + ^

œ

1 -

1 -eVX

œ

=Z(-1)'

/=1

X(i +1)-1

1 -e

1 e

(

1 -e-X

z(/+1)-1

œ ,

a( / +1)-1 k

\

v X2

œ

f(x;A,a,0) = 2Aa0x £ (-1)

i+/+k

+ iYx(i +1)-1

i,/,k=0

Va( / +1)-1 k

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

-v )x

(7)

Equation (7) is the expansion of equation (6) which will be used to derive some of the properties of the distribution.

Also, equation (5) is expanded as

[F(x;X,a,V)]h =

1 -e-V )

1 -

1 -

1-

1-

1 -e-(Vi X

=s (-1)"

1 +

1-

1 -e^ )x

h

m

1-

1 -e-V )x

X -h ' h + p-1 ^

1 + 1 - "1 -e-vK a = X (-1)P P=0 P V 1 - "1 -e-vX a

Xp

1 -

1

X(p+m)

œ

=I(-1)"

X(p + m)

1 e

1 e )x

œ

=X(-1 )'

q=0

№ )x

a-1

Œ

V

X

X i+11-1

k=0

a

u.

h

( u\(

h h œ [F(x;A,a,0)] = ("1)'

m,p=0 z,q=0

m

h + p" 1 P

V

l(p + m)

Vaz^

1% K

(8)

Equation (8) is the expansion of equation (5) which will be used to derive some of the properties of the distribution.

2.1.2 Properties of the Generalized Rayleigh distribution

In this section, some of the mathematical and statistical properties of GRa distribution such as the quantile function, moments, moment generating function, reliability measure, odds function, reversed hazard function and order statistics are derived.

2.1.2.1

Moments

q

E(xr) = Jxrf(x)dx

= 2Xad £ ("1 )

i+j+k

i,j,k=0

1 + i

l(l +1)" 1

a( j +1)" 1

k

№ x

dx

x

r leA 21 k+1

e1%)x dx = 2

e e(k+1)

r| 1 + r

fi , ;V

E[xr) = 21a(k +1) £ ("1)

i,j,k=0

i+j+k

1

<Va{ j +1)" k

1 + Hi x(i +1)" 1 "I /-1-11-1 _

2

j

e[k+1)

2 f \

1+r

V 2,

(9)

(10)

0

0

0

Equation (10) is the moments of GRa distribution. To obtain the mean, we set r = 1 in equation (10).

2.1.2.2 Moment generating function (mgf)

Mix)(t) = ]etxf (x)dx

(11)

since the series expansion for etx is given as

w!

Then, following the method of moments, the mgf is obtained as follows

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

'1 + iYl(i +1)- 1Ya( j +1)-11

M(xM) = 22a(k +1)£-! £ I"1)

w=0 w ! 1 ,j ,k=

1+j+k

j

e(k+1)

2 / \ 1 w 1+—

v 2,

(12)

0

tx

e

»■=0

w

2.1.2.3 Quantile function

Quantile function has a significant position in probability theory and it is the inverse of the cdf. The quantile function is obtained using

Q(u) = F\u) (13)

Using the inverse of equation (5), we have the quantile function of GRa distribution given as

x

-QUO-1

- log

1 -

1 -

1 -u u +1

The median is obtained by setting u = 0.5 in equation (14) given as

Xmedian Q ( 0,5 ) q

- log

1 -

1 -

05 1,5

(14)

(15)

2.1.2.4 Hazard function

Hazard function is given as

f i a\ f(x;a,l,/

n(x;a,A,d) -—--

S(x-,a,l/)

The hazard function of the GRa distribution is given as

(16)

n(x-,a,X,d) -

2ia/xe

2 x

1-e

/2 )x2

1

1-e" /

r

r -(^)x2 ] a

1+ 1- 1-e 1/2)

1

1

1 +

1

1

1-e 1/2)

1-e

2 X

(17)

i-1

Figure 3: Plots of hazard function of the GRa distribution for different parameter values

2.1.2.5

Survival function

The reliability function is also known as survival function, which is the probability of an item not failing prior to some time. It can be defined as S(x;a,1,6) = 1 - F(x;a,1,6)

(18)

The survival function of the GRa distribution is given as

S(x;a,1,6) = 1 -

Figure 4: Plots of survival function of the GRa distribution for different parameter values

2.1.2.6

Reversed hazard function

Reversed hazard function of a random variable x is given as ojf i a\ f(x;a,l,/ F(x;ai/)

The reverse hazard rate function of the GRa distribution is given as

M(x;a,A/) =

2Aa/xe 1/2'

1 -e^ >

1 -

1 -e^ >

I-

-//V a

1 + 1 - 1 -e 1/2)

1 -

1 +

1 -

1 -

1 -e-V)

1 -e-V )

(20)

(21)

2.1.2.7 Odds function

The odds function of the GRa distribution is given as

n(x;a,A,0) = -

1 -

1 -

1 -e

1 -

1 -e-V)

1-

1 -

1 -

1-e

1 - 1 -e

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

(22)

2.2 Order Statistics

Let X ,X ,,,,,X be n independent random variable from the GRa distribution and let X ^X ~",~be their corresponding order statistic. Let Frn(x) and frn(x), r- 1,2,3,,,,n denote the cdf and pdf of the rth order statistics Xrji respectively. The pdf of the rth order statistics of X is given as

n- r

f (x) =-^-g(- 1)v

r:nA ' B(r,n-r + 1)fV '

F(x)v

(23)

The pdf of rth order statistic for the new distribution is obtained by replacing h with v+r-1 in

i-1

equation (8) as

f (x) = 22a0x

r.nv '

1

B(r,n - r+1) v=o ¡J,,

n-r w v+r-1 w

i 11 i (-1

v=0 i,j,k=0m,p=0z,q=(

i+ j+k+m+ p+z+q+v

V+r -1

m

v+r + p-2 P

V

V A

X(p+m)

A

-("2 X

n-r v

V /

k+q+1

1+i

X(i+lj-1

a{ j +1j-1 k

\

A /

The pdf of minimum order statistic of the GRa distribution is obtained by setting r=1

n-1 w v w

f.a(x) = 2Xadxn! £ £ I(-1j

i+J+k+m+p+z+q+v

v

v=0 i ,j k=0 m ,p=0 z ,q=0

i.Aiv + p_1 Yx(p + mj p

(1 + i

0 )x2

V /V /

k+q+1

X(i+1)-1

Y

a ( j +1)-1 k

\

V / V /V / V /

Also, the pdf of maximum order statistic of the distribution is obtained by setting r = n

(24)

(25)

v+n-1 w w

/Jx) = 22 adxn I I I(-1)

i+j+k+m+p+z+q+v

m,p=0 i,j,k=0 z,q=0

1 + i

v + n-1

m

v+n+p-2 p

V /V

X(p + m)

V

az

q

X(i +1)-1

a (j +1)-1 k

k+q+1

(26)

2.3 Estimation method

The method of maximum likelihood estimation (MLE) is used in this section to estimate the parameters of the GRa distribution. For a random sample, X ,X ,...,X of size n from the GRa

distribution(a,0,2), the log-likelihood function L(a,0,2) of (6) is given as

0 n

log (l) = nlog (2) + nlog (X) + nlog ( a ) + nlog (0) - "IX +( a - 1)Ilog

2 1

-(2- 1)£log

1 -

1 -e

- 2llog

1 +

1-e

1-

1-e

(27)

Differentiating the log-likelihood with respect to X, a ,0 and equating the result to zero, we have

L-l Vl

d2~2 + i og

r "A v21 a

1 - 1 -e0 )x'

- 21

i a A 9 a X in A ->

1 - 1 -e" log 1 -e°2

L J L J

= 0

1+

r -(^)x2 1 a

1 - 1 -el/2''

(28)

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

X

SL n

= _ + £ log

Sa a 21

1 X

-(a-1)£

1 -e [/2) '

log

-r/X2

1 -e (/2) '

1 -

1 - e-V X

ex

/=1

r -OX21 a A-1 r -OX21 a r -OX21

1 - 1 -e i/2) 2 1 -e i/2) 2 log 1 -e i/2' 2

_ _ J _ _ J _ J

= 0

1+

1 -

1 -e-V X

SL n

№ X (o,

=n-1 Ex;-m^.

SO o 2%

2 m №+, 1

1 -eO

2i 122X

1-

- 2a^E

r n A-1

-OX2 a -OX2

1 - 1 -e ''2>' 1 -e ''2)/

_ J _ J

1 - e O'2 )x;

W*2( 122

- = 0

1+

1-

1 -e-O2 x

(29)

(30)

Now, equations (28), (29) and (30) do not have a simple analytical form and are therefore not tractable. As a result, we have to resort to non-linear estimation of the parameters using iterative method.

II. Results

n

A

a

/=1

x

e

A

/=1

3.1 Applications

In this section, we present two applications of GRa distribution using different data sets from different fields to demonstrate the flexibility of the distribution in modeling real-life data sets. The data are fitted to the GRa distribution and two other distributions as comparators such as Odd Lindley Rayleigh (OLRa) distribution by [11] and Rayleigh (Ra) distribution. This is done to test the new distribution's flexibility against the comparators. Adequacy Model which is a package in R-software, is used to produce the results of the analysis. Using the Akaike information criterion (AIC) and Bayesian information criterion (BIC), respectively, the performance of the distribution was compared to other existing distributions that were consistent with the baseline distribution in terms of providing good parametric fit to the data sets.

AIC = -211 + 2k (31)

BIC = -211 + klog(n) (32)

The model selection is carried out using the AIC and the BIC. Where ll denotes the log-likelihood function evaluated at the maximum likelihood estimates, k is the number of parameters, and n is the sample size from the data.

The model with minimum value of AIC and BIC is chosen as the best model to fit the data set. The comparators presented are odd Lindley Rayleigh (OLRa) distribution and Rayleigh (Ra) distribution.

The first data set, taken from [14] and also reported in [15], shows the locations of the 68 stakes found while walking L = 1000 m and looking w = 20 m on either side of the transect line. The dimensions are:

2.0, 0.5, 10.4, 3.6, 0.9, 1.0, 3.4, 2.9, 8.2, 6.5, 5.7, 3.0, 4.0, 0.1, 11.8, 14.2, 2.4, 1.6, 13.3, 6.5, 8.3, 4.9, 1.5,

18.6, 0.4, 0.4, 0.2, 11.6, 3.2, 7.1, 10.7, 3.9, 6.1, 6.4, 3.8, 15.2, 3.5, 3.1, 7.9, 18.2, 10.1, 4.4, 1.3, 13.7, 6.3, 3.6,

9.0, 7.7, 4.9, 9.1, 3.3, 8.5, 6.1, 0.4, 9.3, 0.5, 1.2, 1.7, 4.5, 3.1, 3.1, 6.6, 4.4, 5.0, 3.2, 7.7, 18.2, 4.1.

The second data set represents the survival times (in weeks) of 33 patients suffering from acute

myelogeneous leukemia. These data have been studied by [16]. The data are:

65, 156, 100, 134, 16, 108, 121, 4, 39, 143, 56, 26, 22, 1, 1, 5, 65, 56, 65, 17, 7, 16, 22, 3, 4, 2, 3, 8, 4, 3, 30,

4, 43.

Table 1: The MLEs and Information Criteria of the models based on the first data set

Model cc i ê -l AIC BIC

GRa 0.4774 6.3091 0.0018 185.7156 377.4313 384.0898

OLRa 1.2608 - 0.0114 364.0566 732.1132 736.5522

Ra - - 0.0362 202.4675 406.1545 409.1545

GRa

OLRa

Ra

O 5 10 15 20

Data

Figure 5: Histogram and fitted pdfs for the GRa, OLRa and Ra models to the first data set Table 2: The MLEs and Information Criteria of the models based on the second data set

Model cc i ê -l AIC BIC

GRa 0.2293 0.6271 0.0005 158.1467 322.2934 326.7829

OLRa 1.2857 - 0.0001 270.2955 544.5910 547.5840

Ra - - 0.0005 188.6356 379.2713 380.7678

Figure 6: Histogram and fitted pdfs for the GRa, OLRa and Ra models to the second data set

IV. Discussion

The estimated values for each parameter and the models' goodness of fits are shown in Tables 1 and 2. AIC and BIC are two metrics for goodness of fits. The model performs better when the AIC and BIC values are lower. Tables 1 and 2 show that the GRa distribution has the lowest AIC and BIC, respectively and this makes the GRa model more adaptable and suitable for handling the data sets.

The new model's forms, fit, and adaptability in connection to the data sets under consideration are shown in Figures 5 and Figure 6. The black line, which represents the GRa model, more closely matched the data's pattern than the competitors. The histogram and fitted plots make it clear that the black line, which represents the GRa distribution, matches the two data sets under consideration better.

This study generalized the Rayleigh distribution by deriving a new continuous distribution known as the generalized Rayleigh distribution. Some of the statistical and mathematical properties of the GRa are obtained such as the survival function, hazard rate function, quantile function, inverted hazard function, odds function, and order statistics from the new distribution. Plotting the pdf and hazard rate function graphs revealed the contours of the suggested distribution. It was discovered that the hazard function is shaped like bathtub. AdequacyModel package in R was used to estimate the model parameters using the maximum likelihood method. The generalized Rayleigh distribution and its comparators considered were applied to two real life data sets, and the outcomes are shown in Tables 1 and 2. The findings demonstrated that the GRa distribution is much more potent, robust and superior at fitting the two data sets under consideration. The density graphs in figures 5 and 6 for the two data sets further show how adaptable and robust the new model is.

References

[1] Lord Rayleigh, F. R. S. (1880). On the Resultant of a Large Number of Vibrations of the same Pitch of Arbitrary Phase. The London, Edinburgh and Dublin Philosophical Magazine and journal of Science, 10, 73-78

[2] Kundu, D., and Raqab, M. Z. (2005). Generalized Rayleigh distribution: Different methods of estimations. Computational Statistics and Data Analysis, 49, 187-200.

[3] Voda, V. G. (2007). A new generalization of Rayleigh distribution. Reliability: Theory and Applications, 2(2): 1-10

[4] Dey, S. (2009). Comparison of Bayes estimators of the parameter and reliability function for Rayleigh distribution under different loss functions. Malaysian Journal of Mathematical Sciences, 3(2): 249-266.

[5] Merovci, F. (2013). Transmuted Rayleigh distribution. Austrian Journal of Statistics, 42, 2131.

[6] Merovci, F. (2014). Transmuted generalized Rayleigh distribution. Journal of Statistics Applications and Probability, 3(1): 9-20.

[7] Merovci, F. and Elbatal, I. (2015). Weibull Rayleigh Distribution: Theory and Applications, Applied Mathematics and Information Sciences, 9(4): 2127-2137.

[8] Mahmoud, M., and Ghazal, M. (2017). Estimations from the exponentiated Rayleigh distribution based on generalized Type-II hybrid censored data. Journal of the Egyptian Mathematical Society, 25, 71-78.

[9] Ateeq, K., Qasim, T. B. and Alvi, A. R. (2019). An extension of Rayleigh distribution and applications, Cogent Mathematics and Statistics, 6:1.

[10] Alzaatreh, A., Lee, C., and Famoye, F. (2013). A new method for generating families of continuous distributions, Metron, 71(1), 63-79.

[11] Ieren, T. G., Abdulkadir, S. S. and Issa, A. A. (2020). Odd Lindley-Rayleigh Distribution: Its Properties and Applications to Simulated and Real Life Datasets, Journal of Advances in Mathematics and Computer Science, 35(1): 63-88.

[12] Abdulsalam, H. A., Yahaya, A. and Dikko, H. G. (2021). On the properties and applications of a new extension of exponentiated Rayleigh distribution, FUDMA Journal of Sciences, 5(2): 377-398.

[13] Bello, O. A., Doguwa, S. I., Yahaya, A. and Jibril, H. M. (2020) A type I half Logistic exponentiated-G family of distributions: Properties and application, Communication in Physical Sciences, 7(3): 147 - 163.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

[14] Patil, G. P., Patil, G. P., and Rao, C. R. (1994). Handbook of statistics 12: Environmental statistics

[15] Almetwally, E. M and Meraou, M. A. (2022). Application of Environmental Data with New Extension of Nadarajah-Haghighi Distribution, Computational Journal of Mathematical and Statistical Sciences, 1(1): 26-41.

[16] Feigl, P., and Zelen, M. (1965). Estimation of exponential probabilities with concomitant information, Biometrics; 21:826-38.

i Надоели баннеры? Вы всегда можете отключить рекламу.