THE ROLE OF RECORD VALUES IN STATISTICAL INFERENCE: A REVIEW ARTICLE
Mahmoud A. Selim Alsanea
Applied College, King Khalid University, Saudi Arabia & Department of Statistics, Faculty of Commerce, Al-Azhar University, Egypt
Selim. [email protected]
Abstract
The record values data have received the attention of researchers in statistics for over seven decades. Through these decades the records have played a significant and widely utilized role for statistical inference in parameter estimation, predicting future values, hypothesis tests, as well as stress-strength tests, and characterizing distributions. In this paper, the types of record values, some distributional properties, and statistical inferences of record values and their applications are reviewed. The purpose of this paper is to shed light on the role of record values in statistical inference. Therefore, we will examine this issue from two perspectives, the first perspective being estimation and the second perspective being prediction. These are through some of the most important lifetime distributions are Exponential, Weibull, Gumbel, Geometric, Pareto, Generalized exponential, Rayleigh, Lomax, and Nadarajah-Haghighi distributions. I hope that the findings of this paper will be useful for researchers in various fields and lead to further enhancement of research in record values theory and its applications.
Keywords: maximum likelihood estimation; maximum likelihood predictor; record values; point prediction; probability distribution; Bayesian estimation; Bayesian prediction
1. Introduction
In statistics, a record value or record statistic is the largest or smallest value obtained from a sequence of random variables. Record values arise naturally in both theoretical and practical areas of probability and statistics. On the practical side, Record values are of interest and importance in several branches of studies such as, hydrology, seismology, psychology, medicine, engineering. All of us constantly hear of new records being created in events such as stock market prices, rainfall, temperature, flood level, athletic events, oil, and mining surveys etc. In any field, whenever a new high or a new low value is observed, in connection with the phenomena under study, it becomes a part of history and will be called as a record. On the theoretical side, various statistical inference procedures such as point or interval estimation and prediction as well as hypothesis testing can be developed based on observed record sequences.
Records become extremely important and necessary in some cases, including when we only want to study the value of the events that exceed the previous ones, or when observations are destroyed by experimental tests, or it is impossible to obtain a complete sample. Overall, records can be useful in any situation where there is a need to track and analyze data over time.
Mahmoud Ali Selim RT&A, No 1 (77) THE ROLE OF RECORD VALUES IN STATISTICAL INFERENCE_Volume I9, March 2024
The aim of this study to shed light on the role of record values in statistical inference. The rest of this paper reviews the records from the following aspects: Section 2 introduces the definition of record values and their distributional properties. Section 3 introduces a literature review on uses of record values in statistical inference. Section 4 reviews the record values data in practical applications. Section 5 reviews the computer software for records. Section 6 discusses the future works. Section 7 introduces some conclusions.
2. Definition of Record Values and their Distributional Properties
The formal study of record value theory probably started with the pioneering paper by Chandler [1]. In this section, the definition of upper and lower and k-th record values and their distributional properties are introduced.
2.1 Upper Record Values
Let xl,x2, ... ... be a sequence of independent and identically distributed (iid) random variables that have cumulative distribution function (cdf) F(x) and probability density function (pdf) /(x). Let V„ = max{A'1,X2, ...,Xn} forn> 1. We say A is an upper record value of this sequence if i > ,7 > 1. Thus A will be called an upper record value if its value exceeds that of all previous observations. The first record i = is called the trivial record.
The times at which records appear are of interest and are called record times. The random variables U(0) = 1, and U(m) = min{/: j > - 1), Aj > Xtf(m-1)} are called the upper record times, and the sequence { y(m),m > 0} is called the sequence of upper record times.
The sequence of inter-record times, denoted by { Tn, n > 1}, is defined as Tn = U(n + 1) — U(n),n = 1,2,...
Many distributional properties of upper record values in the sequence of iid continuous random variables ... , X^» with cdf F(x) and pdf f(x) have been expressed in terms of the
function R(x) = —ln[1 - F(x)]. The pdf of the upper record value ArU(m),(see Arnold, et al. [2]) is
(R(x))m-1
/»(x) =^l)T/(x), —ra<x<~ (2.1)
= xm is (2.2)
(2.3)
and the joint pdf of the first (m) upper record values = x1,Xu(2) = x2,. given by
m-1 , „
/(xj)
/i,2,...,m(x1, x2, , xm) = /(xm) j j ^
¿ = 1
w
and the joint pdf of the upper record values and (n < m) is
/n,»(x, y) = (n — ff^—n — 1) • 1—^x) • [R(y)—R(x)]m-n-1/(y),
to<x< v < to, n = 0,1, — , n<m,
the conditional probability density function of the upper record values given can be expressed as follows
/(x,|xj) = 0 — ¿ — 1) 1 —f^;^ (2.4)
— TO < < xj < TO
where fl(.) = — ln(1 — F(.))
Mahmoud Ali Selim RT&A, No 1 (77) THE ROLE OF RECORD VALUES IN STATISTICAL INFERENCE_Volume 19, March 2024
2.2 Lower Record Values
Let X1,X2, ...Xn,... be a sequence of iid random variables from a continuous distribution with cdf F(x) and pdf f(x). Let Yn = min[X1,X2, ...,Xn} for n > 1. We say Xj is a lower record value of this sequence if Yj < Yj-1 ,j > 1. Thus Xj will be called a lower record value if its value is lower than of all previous observations. By definition X1 is a lower record value. The times at which record appear are of interest which called a record times.
The random variables L(0) = 1, and L(m) = min{j. j > L(m — 1), Xj < XL(m-1-J are called the lower record times, and the sequence {L(m-,m > 0} is called the sequence of lower record times.
Many distributional properties of lower record values in the sequence of iid continuous random variables X1,X2,... , XL(m- with cdf F(x) and pdf f(x) have been expressed in terms of the function G(x) = — lnF(x). The pdf of the lower record value XL(mis
>m-1
fm(x)=(fa=Wf(x), -œ<x<œ (2.5)
and the joint pdf of the first (m) lower record values XL(1) = x1,XL(2) = x2,..., XL(m) = xm is given by
m-1 f( 1
fl,2.....m(xi,x2, ...,xm) = f(x(m)) I . (°v (2.6)
j=1F(x(o)
-m <xm < xm-i < ■■■ < xi < m and the joint pdf of the lower record values XL(S) and XL(r) (r < s) is
[ G(x)]r-1 f(x)
fr^ = {r - ms- r-iy/W)[G(y) -G(x)]s-r-1f(y) (27)
—i
-x> < y < x < X> where x = XL(r- andy = XL(S).
and the conditional pdf of the lower record values XL(j- given XLcan be expressed as follow
(G(xi) — G(xi)]J 1 1 f(x.-Q)
n^x)=v (j—t—iy. FxJy —-<xJ<x^<- (28)
where G(.) = — lnF(.) for more details, see for example, Ahsanullah and Nevzorov [3]. 2.3 The K-th Upper Record Values
Let {Xn,n > 1} be a sequence of iid random variables with a cdf F(x) and pdf f(x). The j-th order statistic of the sample X1,X2,... ,Xn is denoted by Xj.n. For a fixed positive integer k, Dziubdziela and Kopodnski [4] defined the sequence { U^\n > 1} of k-th upper record times for the sequence {Xn,n > 1} as follows: U[k) = 1
Un+1 = min{j > Un).Xj.j + k-1 > XU(V:U(nk) + k-1},
Then the sequence {y^),n > 1}, where y'^ = X^k) ,u(k)+k_ 1 is called a sequence of k-th upper
record values of {Xn, n > 1}. For convenience, we also take Y® = 0. Note that for k = 1, we get the
usual upper record values as defined in Chandler [1].
The pdf of y'^ (n > 1) as given by Grudzien [5] is k,n
fY(k)(x) =^:^[-lnF(x)]n-1[F(x)]k-1f(x), —™<x<™ (2.9)
and the joint pdf of Ymk) and Y,1 < m < n, n > 2, is
Mahmoud Ali Selim RT&A, No 1 (77)
THE ROLE OF RECORD VALUES IN STATISTICAL INFERENCE_Volume I9, March 2024
fc"
/„(*)„№)(*, y) = 7-TT77-TT7[-/ni?(y) + ZnF(x)]™1
ym (m - 1)! (n - m - 1)!
f(x)
x[-ZnF(x)]m-1^[F(y)]fc-1/(y), x < y (2.10)
Fix)
2.4 The K-th Lower Record Values
Let {Xn, n > 1} be a sequence of iid random variables with a cdf F(x) and pdf /(x). The j-th order statistic of a sample ...,Xn is denoted by XJ:n. For a fixed positive integer k, we defined
the sequence {Lfc(n), n > 1} as k-th lower record times of {Xn, n > 1} as follows: ifc(n) = 1
Lfc(n + 1) = min{; > ¿fc(n):Xfc:Lft(n)+fc-i > X^^-i), The sequence {l ),
n > 1}, where 7n(fc) = Xfc
:^(n)+fc-1 is called a sequence of k-th lower record
values of {Xn, n > 1}. For convenience, we also take Y^k) = 0. Note that for k = 1, we get the usual lower record values as defined in Chandler [1].
The pdf of Y(k)n (n > 1) as given by Grudzien [5] is
fcn
/y№)(x) = ^TYjy[-/nF(x)]n-1[F(x)]fc-1/(x), n > 1 (2.11)
and the joint pdf of l^-1 and 7n(fc),1 < m < n, n > 2, is
= (m-1)!(n-m-1)!HnF(y) + ^nF(x)]n-"-1
/(x)
x [- Zn F(x)]m-1 ^ [F(y)]fc-1/(y), y < x (2.12) 3. Literature Review of Statistical Inference of Record Values
This section presents a review of the statistical literature to emphasize the role of record values in statistical inference. Therefore, we will examine this issue from two perspectives, the first perspective being estimation and the second perspective being prediction. These are through some of the certain distributions are Exponential, Weibull, Gumbel, Geometric, Pareto, Generalized exponential, Rayleigh, Lomax, and Nadarajah-Haghighi distributions.
3.1 Review of Previous Studies on Estimation based on Record Values
In this subsection, previous studies on estimation problems based on record values for some of the certain distributions are reviewed.
3.1.1 Exponential distribution
Jaheen [6] obtained the ML and empirical Bayes estimate for the parameter of the exponential model based on record statistics. The estimate is obtained using the squared error loss and Varian's linear-exponential (LINEX) loss functions. Ahmadi and Doostparast [7] presented Bayes estimation when the data consist of k record values from a two-parameter exponential distribution under linear exponential loss function. Balakrishnan and Stepanov [8] discussed the Fisher information contained in records. In the case when the initial distribution belongs to the exponential family. Doostparast [9] derived the Bayesian and non-Bayesian estimates for the two parameters of the exponential distribution based on lower record values, with respect to the squared error (SE) and LINEX loss functions, and then compared with together. Arnold, et al. [2] obtained the ML estimates and best linear unbiased estimator (BLUE) for the exponential distribution. Wu [10] presented the interval estimation for the scale parameter of two-parameter exponential
distribution using upper record values. In addition, two methods for the joint confidence region of two parameters are proposed. Asgharzadeh, et al. [11] proposed two families of optimal confidence regions for the location and scale parameters of the two-parameter exponential distribution based on upper records. Constrained optimization problems are used to find the smallest-area confidence regions for the exponential parameters with a specified confidence level. Baklizi [12] considered the stress-strength reliability when the available data is in the form of record values from the one parameter and two parameters exponential distribution. The ML estimators and the associated confidence intervals are derived. Ahsanullah and Aliev [13] considered several distributional properties of the upper records from the exponential distribution and presented some characterizations of the exponential distribution.
3.1.2 Weibull distribution
Abd-El-Hakim and Sultan [14] obtained the maximum likelihood estimators for the location and scale parameters of Weibull distribution based on upper record values. Soliman, et al. [15] discussed a Bayesian analysis in the context of record statistics values from the two-parameter Weibull distribution. The ML and the Bayes estimates based on record values are derived for the two unknown parameters and some survival time parameters e.g., reliability and hazard functions. The Bayes estimates are obtained based on a conjugate prior for the scale parameter and a discrete prior for the shape parameter of this model. This is done with respect to both symmetric loss function (squared error loss), and asymmetric loss function (linear-exponential (LINEX)) loss function. Jafari and Zakerzadeh [16] proposed a simple and exact test and a confidence interval for the shape parameter. In addition to a generalized confidence interval, a generalized test variable is derived for the scale parameter when the shape parameter is unknown. The paper presents a simple and exact joint confidence region as well. Wang and Ye [17] investigated point estimation and confidence intervals estimation for the Weibull distribution based on record data. The uniformly minimum variance unbiased estimator for the Weibull shape is derived. Based on this estimator, a bias-corrected estimator for the Weibull scale is obtained and it is shown to have much smaller bias and mean squared error compared with the maximum likelihood estimator. Confidence intervals for parameters and reliability characteristics of interest are constructed using pivotal or generalized pivotal quantities. Then the results are extended to the stress-strength model involving two Weibull populations with different parameter values. Construction of confidence intervals for the stress-strength reliability is discussed. Raqab, et al. [18] considered the problem of the estimation for the 3-parameter Weibull distribution based on record data. The maximum likelihood method is used for the estimation of all parameters involved in the model. Hassan, et al. [19] investigated the estimation of multicomponent stress-strength reliability following Weibull distribution based on upper record values. Al-Duais [20] developed a LINEX loss function to estimate the parameters and reliability function of the Weibull distribution based on upper record values when both shape and scale parameters are unknown. They performed this by merging a weight into LINEX to produce a new loss function called the weighted linear exponential (WLINEX) loss function. Then, they utilized WLINEX to derive the parameters and reliability function of the Weibull distribution. The results revealed that the proposed method is the best for estimating parameters and has good performance for estimating reliability.
3.1.3 Gumbel distribution
Ahsanullah [21] obtained ML, best linear invariant and minimum variance unbiased (MVU) estimators of the Gumbel location and scale parameters. Mousa, et al. [22] obtained the Bayesian estimators for the two parameters of the Gumbel distribution based on lower record values. Malinowska and Szynal [23] obtained Bayesian estimation for the two parameters of a Gumbel distribution based on k-th lower record values. Seo and Kim [24] addressed inference problems for
Gumbel distribution when the available data are lower record values. they first derive unbiased estimators of unknown parameters, and then, they construct an exact confidence interval for the scale parameter by deriving certain properties and pivotal quantities. For Bayesian inference, they derive noninformative priors such as the Jeffreys and reference priors for unknown parameters and examine whether they satisfy the probability matching criteria; then, they apply them to develop objective Bayesian analysis. Asgharzadeh, et al. [25] presented exact confidence intervals and joint confidence regions for the parameters of Gumbel distribution based on record data. Exact confidence intervals and joint confidence regions for the parameters of inverse Weibull distribution are also discussed. Three numerical examples with climate data are presented to illustrate the proposed methods.
3.1.4 Geometric distribution
Ahsanullah and Holland [26] discussed some distributional properties of the record values of non-identically distributed random variables having geometric distributions. Three theorems dealing with the characterization of the geometric distribution based on these distributional properties are presented. The unique minimum variance unbiased estimators of some functions of the parameters of the distribution are studied. Ahmadi and Doostparast [7] obtained Bayesian and non-Bayesian estimators of the parameter of geometric distribution based on upper record values. Okasha and Wang [27] E-Bayesian and Bayesian methods have been used for estimating the parameter, reliability, and hazard functions of the geometric distribution based on upper record value samples. Francis, et al. [28] obtained the shrinkage estimate of R = P(X <Y) when X the stress and Y the strength are independent geometric variable and the sample on Y the strength is upper records.
3.1.5 Pareto distribution
Arnold and Press [29] discussed the Bayesian estimation for Pareto data based on record values. El-Qasem [30] used the upper record values to obtain the ML estimator for the uniform, the exponential and the Pareto distribution with one parameter. Sultan and Moshref [31] obtained the best linear unbiased estimates for the location and scale parameters of record values from the generalized Pareto distribution. Raqab, et al. [32], Raqab [33] obtained the ML and Bayes estimators from the two-parameter Pareto distribution for the two unknown parameters based on record values. Doostparast, et al. [34] on the basis of record values from the two-parameter Pareto distribution, ML and Bayes estimators as well as credible regions are developed for the two parameters of the Pareto distribution. Ahsanullah and Shakil [35] established some new results on the characterizations of the Pareto distribution by upper record values. Azhad, et al. [36] discussed inferences about the multicomponent stress strength reliability are drawn under the assumption that strength and stress follow independent Pareto distribution under the setup of upper record values. The ML estimator, Bayes estimator under squared error and LINEX loss functions, of multicomponent stress-strength reliability are constructed.
3.1.6 Generalized exponential distribution
Jaheen [6] derived Bayes and empirical Bayes estimators for the one-parameter of the generalized exponential distribution based on lower record values. These estimates are obtained based on squared error and LINEX loss functions. Madi and Raqab [37] used the importance sampling to estimate the model parameters. Baklizi [38] considered the ML and Bayesian estimation of the stress-strength reliability based on lower record values from the generalized exponential distribution. Confidence intervals, exact and approximate, as well as the Bayesian credible sets for the stress-strength reliability are obtained. Dey, et al. [39] derived the ML estimates and the Bayes estimates based on lower records for the unknown parameters of the
generalized exponential distribution. The Bayesian estimation of the parameters of the generalized exponential distribution has been studied with respect to both symmetric and asymmetric loss functions. They have also derived the Bayes interval. Sana and Faizan [40] obtained ML estimators for the two unknown parameters of the generalized exponential distribution based on lower record values. They also obtained the Bayes estimators of the unknown parameters using Lindley's approximation under symmetric and asymmetric loss functions.
3.1.7 Rayleigh distribution
Balakrishnan and Chan [41] derived explicit expressions for the means, variances and covariances from a Rayleigh distribution. They also established some recurrence relationships for the single and product moments. These results are then used to derive explicitly the best linear unbiased estimators for the scale-parameter as well as the location-scale parameter cases. Hendi, et al. [42] obtained the Bayes estimators for the parameter, reliability function, and failure rate function based on upper record values of Rayleigh distribution. These estimators are obtained on the basis of square error and LINEX loss functions. Soliman and Al-Aboud [43] obtained the estimators of the parameter of Rayleigh distribution based on upper record values, Bayesian and non-Bayesian approaches have been used to obtain the estimators of the parameter, and some lifetime parameters such as the reliability and hazard functions. Ahsanullah and Shakil [44] established some results on characterizations of Rayleigh distribution based on order statistics and record values. Seo, et al. [45] provided the exact confidence intervals for unknown by providing some pivotal quantities in the two-parameter Rayleigh distribution based on the upper record values. Finally, the validity of the proposed inference methods was examined from Monte Carlo simulations and real data. Seo and Kim [46] provided an objective Bayesian analysis method based on the objective priors (the Jeffreys and reference priors, and the second-order PMP) for unknown parameters of the two-parameter Rayleigh distribution when the upper record values are observed. Abdi and Asgharzadeh [47] presented exact joint confidence regions for the parameters of the Rayleigh distribution based on record data. By providing some appropriate pivotal quantities, they construct several joint confidence regions for the Rayleigh parameters. These joint confidence regions are useful for constructing confidence regions for functions of the unknown parameters.
3.1.8 Lomax distribution
Lee and Lim [48] characterized the Lomax distribution by conditional expectations of record values. Nasiri and Hosseini [49] obtained ML estimation based on records and a proper prior distribution to attain a Bayes estimation (both informative and non-informative) based on records for quadratic loss and squared error loss functions. The study considers the shortest confidence interval and highest posterior distribution confidence interval based on records. Mahmoud, et al. [50] considered the Bayes estimators of the unknown parameters of the Lomax distribution under the assumptions of gamma priors on both the shape and scale parameters. The Bayes estimators cannot be obtained in explicit forms. So, they propose Markov Chain Monte Carlo (MCMC) techniques to generate samples from the posterior distributions and in turn computing the Bayes estimators. Point estimation and confidence intervals based on ML and bootstrap methods are also used. Mahmoud, et al. [51] addressed the problem of estimating R = P[l < X] for the Lomax distributions, and classical and MCMC Bayesian analysis for R were developed when both samples on X and Y are in the form of upper record values, observed from the Lomax distribution. Hassan and Zaky [52] considered estimation of entropy for Lomax distribution based on upper record values. Bayesian estimator of Shannon entropy is discussed under informative and noninformative priors. The entropy Bayesian estimator and the corresponding credible interval based on a LINEX, squared error loss functions are derived.
3.1.9 Nadarajah-Haghighi distribution
Selim [53] discussed maximum likelihood and Bayes estimation of the two unknown parameters of Nadarajah and Haghighi distribution based on record values. It assumed that in Bayes case, the unknown parameters of Nadarajah and Haghighi distribution have gamma prior densities. Lindley approximation is exploited to obtain point estimators for the unknown parameters. Sana and Faizan [54] discussed maximum likelihood and Bayes estimation of the two unknown parameters of Nadarajah and Haghighi distribution based on record values. Different Bayes estimates are derived under squared error, balanced squared error and general entropy loss functions by using Jeffreys' prior information and extension of Jeffreys' prior information. Tierney and Kadane approximation method used to compute these estimates. MirMostafaee, et al. [55] obtained exact explicit expressions as well as several recurrence relations for the single and product moments of record values and then these results are used to compute the means, variances and the covariances of the upper record values. Also, these calculated moments are used to find the best linear unbiased estimators of the location and scale parameters of NH distribution. Confidence intervals for the unknown parameters are also discussed.
3.1.10 General classes of distributions
Abu-Youssef [56] characterized general classes of continuous distribution by considering the conditional expectation of function of record values. The specific distribution considered as a particular case of the general class of distribution are Weibull, Pareto, power function, Burr, beta of the first kind, Cauchy, rectangular, Rayleigh, Lomax, and inverse Weibull distributions. Ahmadi and Doostparast [7] obtained Bayesian estimation for the two parameters of some life distributions, including Exponential, Weibull, Pareto and Burr type XII, based on upper record values. Ahmadi, et al. [57] showed how to develop Bayes estimation in the context of upper k-record data from a semi-parametric class of distributions that includes several well-known lifetime distributions such as exponential, Weibull (one parameter), Pareto and Burr type XII under some balanced type of loss functions. Malinowska and Szynal [58] characterized general classes of continuous distributions by the conditional expectation of the kth lower record values. Specific distributions inverse exponential, inverse Weibull, inverse Pareto, negative exponential, negative Weibull, negative Pareto, negative power, Gumbel, exponentiated-Weibull, loglogistic, Burr X, inverse Burr XII and inverse paralogistic distributions.
3.2 Reviewing Previous Studies on Prediction based on Recorded Values
In this section, to accentuate the role of recorded values in statistical prediction, we will review the literature on prediction problems based on record values for some of the certain distributions.
3.2.1 Exponential distribution
Ahsanullah [59] obtained best linear unbiased predictor and best linear invariance predictor of future records Xs based on X^ 1 < i < m, for m < s, using the standard least squares theory. Dunsmore [60] studied the problem of predicting future records from the Bayesian viewpoint and derived classical results for the exponential and the gamma models. Awad and Raqab [61] considered the prediction problem of the future nth record value based on the first m (m<n) observed record values from one parameter exponential distribution. Jaheen [6] obtained empirical Bayes prediction bounds for future record values. Ahmadi and Doostparast [7] presented Bayes prediction procedures when the data consist of k record values from a two-parameter exponential distribution under linear exponential loss function. Ahmadi and MirMostafaee [62] studied the
Mahmoud Ali Selim RT&A, No 1 (77)
THE ROLE OF RECORD VALUES IN STATISTICAL INFERENCE_Volume 19, March 2024
problem of predicting future records based on observed order statistics from two-parameter exponential distribution. The prediction intervals for the future order statistics as well as for the total lifetime in a future sample of size m from two parameter exponential distribution are obtained on the basis of the first n records coming from the same distribution. Asgharzadeh, et al. [11] proposed two families of optimal confidence regions for the location and scale parameters of the two-parameter exponential distribution based on upper records.
3.2.2 Weibull distribution
Soliman, et al. [15] derived Bayesian predictive density function for Weibull distribution, which is necessary to obtain bounds for predictive interval of future record. Paul and Thomas [63] studied prediction of a future record for Weibull distribution using best linear unbiased predictor. Raqab, et al. [18] considered the problem of prediction for the 3-parameter Weibull distribution based on record data. The ML method is used for the joint prediction of future records along with the estimation of all parameters involved in the model. The existence and uniqueness of the MLPs of future records as well as the PMLEs of all unknown quantities were discussed in detail. Volovskiy and Kamps [64] considered point prediction of future record values from a sequence of independent and identically distributed two-parameter Weibull random variables using the maximum likelihood method. Two likelihood functions for prediction, the predictive and the observed predictive likelihood functions, are considered and the associated predictors are derived. Mean squared error and Pitman closeness criterion are used for comparing the prediction procedures.
3.2.3 Gumbel distribution
Ahsanullah [65] gave two types of predictors of the n-th record value based on the first m (m<n) record values. Mousa, et al. [22] obtained the Bayesian predictions, either point or interval, for future lower record values. Malinowska and Szynal [23] obtained Bayesian prediction, either point or interval, for future n-th lower record values. Seo and Kim [24] addressed inference problems for Gumbel distribution when the available data are lower record values. They first derive unbiased estimators of unknown parameters, and then, they construct a predictive interval for the next lower value by deriving certain properties and pivotal quantities.
3.2.4 Geometric distribution
Ahsanullah and Holland [26] discussed some distributional properties of the record values of non-identically distributed random variables having geometric distributions. Three theorems dealing with the characterization of the geometric distribution based on these distributional properties are presented. Also various predictors of the nth record valued utilizing the first m(m<n) record values are studied. Ahmadi and Doostparast [7] considered Bayesian and non-Bayesian prediction, either point or interval, of geometric distribution based on the past record values observed.
3.2.5 Pareto distribution
Arnold and Press [29] discussed the Bayesian prediction for Pareto data based on record values. Madi and Raqab [66] used the Bayesian approach to establish future predictions for the Pareto records. Raqab, et al. [32], Raqab [33] used the Bayesian approach to predicting future record values, either point or interval, from the Pareto distribution based on the past record values observed. Also, the ML prediction of the future records and other classical methods are used for obtaining prediction intervals for the future records. Paul and Thomas [67] studied prediction of future records of Pareto distribution by using best linear unbiased predictors. Shafay, et al. [68] discussed the problem of prediction of the two-parameter Pareto distribution from a future
sample. The Bayesian approach is applied to construct predictors based on observed k-record values for the cases when the future sample size is fixed and when it is random. Several Bayesian prediction intervals are derived.
3.2.6 Generalized exponential distribution
Jaheen [6] obtained the prediction bounds for future lower record values from the generalized exponential distribution by using Bayes and empirical Bayes techniques. Madi and Raqab [37] described and used a Bayesian parametric approach to predict the behavior of further Los Angeles rainfall records from generalized exponential distribution. Importance sampling is used to estimate the model parameters, and the Gibbs and Metropolis samplers are used to implement the prediction procedure. Dey, et al. [39] derived the Bayes interval and discussed the Bayesian prediction intervals of the future record values based on the observed record values. Vidovic [69] investigated Bayesian point predictors of order statistics from a future sample based on the k-th lower record values from generalized exponential distribution. Sana and Faizan [40] derived the Bayesian prediction for the future record values from generalized exponential distribution.
3.2.7 Rayleigh distribution
Balakrishnan and Chan [41] developed the prediction of a future record value and the test for superiority of the current record values from a Rayleigh distribution. Soliman and Al-Aboud [43] obtained Bayesian prediction intervals of the future record values from Rayleigh distribution. Seo, et al. [45] provided the exact predictive intervals for the future upper record values by providing some pivotal quantities in the two-parameter Rayleigh distribution based on the upper record values. Finally, the validity of the proposed inference methods was examined from Monte Carlo simulations and real data. Seo and Kim [46] provided an objective Bayesian analysis method based on the objective priors (the Jeffreys and reference priors, and the second-order PMP) for unknown parameters of the two-parameter Rayleigh distribution when the upper record values are observed. Abdi and Asgharzadeh [47] presented exact joint confidence regions for the parameters of the Rayleigh distribution based on record data. By providing some appropriate pivotal quantities, they construct several joint confidence regions for the Rayleigh parameters. These joint confidence regions are useful for constructing confidence regions for functions of the unknown parameters.
3.2.8 Lomax distribution
Volovskiy and Kamps [70] Point prediction of future record values from a sequence of independent and identically distributed Pareto and Lomax random variables is addressed. The focus is on likelihood-based prediction techniques; in particular, the maximum likelihood as well as the maximum observed likelihood prediction principles are invoked to derive predictors. Moreover, one-sided prediction intervals are also addressed.
3.2.9 Nadarajah-Haghighi distribution
MirMostafaee, et al. [55] investigated based on the observed records, how to obtain best linear unbiased predictor for the future record values. prediction intervals for future records are also discussed. Selim [53] discussed the Bayesian and non-Bayesian predictions of both point and interval predictions of the future record values.
3.2.10 General classes of distributions
AL-Hussaini and Ahmad [71] obtained Bayesian prediction bounds for the nth future record value based on the one-sample scheme, all of the informative and future observations are assumed
to be obtained from a general class of distributions which includes the Weibull, compound Weibull, Pareto, beta, Gompertz, compound Gompertz among other distributions. Ahmadi and Doostparast [7] obtained prediction, either point or interval, for future upper record values from a Bayesian view point of some life distributions, including Exponential, Weibull, Pareto and Burr type XII. Ahmadi, et al. [72] discussed the problem of predicting future k-records based on k-record data for a large class of distributions, which includes several well-known distributions such as: exponential, Weibull (one parameter), Pareto, Burr type XII, among others.
4. The Record Values Data in Applications
This section reviews the applications of records in various disciplines. The purpose of this review is to show the widespread use of real records data in statistical inference.
4.1 Applications on weather, rainfall, and floods
Raqab and Balakrishnan [73] considered the record values of daily temperatures (in degrees Fahrenheit) recorded at the National Center of Atmospheric Research (NCAR) during the year 2005. Nadar, et al. [74] considered the data set represents the monthly water capacity data from the Shasta reservoir in California, USA and were taken for the month of February from 1991 to 2010. Chacko and Mary [75] used the data which represent the records of the total annual rainfall (in inches) at Oxford, Eng-land, for the years 1858-1903. Seo and Song [76] analyzed two real data sets: one is the average annual temperatures (in degrees centigrade) recorded at Daejeon in Korea from 1969 to 2016). The other is carbon dioxide (CO2) emissions in Trinidad and Tobago from 1971 to 2016. Volovskiy and Kamps [77] considered data collected by the German Federal Office of Hydrology in its role as a scientific advisor to the Federal Waterways and Shipping Administration. The data set contains hourly measurements (in cm) of water level for the time period from January 1918 to February 2019 collected at the measurement site Cuxhaven-Steubenhoft located at the river Elbe. Selim [53] considered the real data set which represent the total annual rainfall (in inches) during the month of January from 1880 to 1916 recorded at Los Angeles Civic Center. Asgharzadeh, et al. [78] analyzed the total annual rainfall (in inches) during March recorded at Los Angeles Civic Center from 1973 to 2006 (see the website of Los Angeles Almanac: www.laalman-ac.com/weather/we08aa.htm). Raqab, et al. [79] discussed the analysis of real life data representing the water level exceedances over the level 65m by the River Nidd at Hunsingore Weir which is located in North Yorkshire, England from 1934 to 1970. Tripathi, et al. [80], Awwad, et al. [81] considered a real data set regarding the March precipitation measured in inches, over a period of 30 years which was reported by Hinkley [82].
4.2 Applications in industry and life-testing
Salehi and Doostparast [83] considered a data set on life testing of an given electrical equipment, planned for quality control purposes. Singh, et al. [84] considered the data set represents the failure times (in h) of 59 conductors from an accelerated life test from Lawless [85]. Vidovic [86] considered the case where failure and running times (1000 of cycles) of a sample of 30 units of a larger electrical system are under study. Wang, et al. [87] considered the real-life data set from Lawless [25, p. 3] which represents the times to breakdown of an electrical insulating fluid subjected to 30 kilovolts. Wu [10] considered the data for times between successive failures of air conditioning equipment in a Boeing 70 airplane.
4.3 Applications in health and medicine
Salehi and Doostparast [83] considered a data set representing the Hemoglobin of the Australian 102 men athletes data. Seo and Kim [88], Awwad, et al. [81] discussed the analysis of
the data of the survival times in days of a group of lung cancer patients provided in Lawless. Kumar, et al. [89] reanalyzing Efron's data pertaining to a head-and-neck cancer clinical trial. EL-Sagheer, et al. [90], Fayyazishishavan and Kilig Depren [91] used the data represents a COVID-19 data belonging to the Netherlands of 30 days, which recorded from 31 March to 30 April 2020.
4.4 Miscellaneous applications
Carlin and Gelfand [92] considered the record-breaking Olympic high jumps since 1896, as presented in the World Almanac and Book of Facts 1989 by Hoffman [93]. Tam§ [94] considered the data includes of the monthly actual taxes revenue (in million Egyptian pounds) in Egypt from January 2006 to November 2010. Volovskiy and Kamps [70] applied the proposed prediction procedures to the well-known Danish reinsurance claims dataset to predict record fire losses. The data were collected at Copenhagen Reinsurance and consist of 2167 fire losses in millions of Danish Krone between 1980 and 1990.
5. Computer Software for Records
Software specialized in calculating record values is quite rare. In program R, for example, the built-in routines for computing the records are available in two packages:
5.1 Package "Records": This package includes Functions for producing lower k-record times; lower k-record values; upper k-record times; upper k-record values for given samples (See Appendix). https://CRAN.R-project.org/package=Records.
5.2 Package "RecordTest": This package includes statistical tools based on the probabilistic properties of the record occurrence in a sequence of independent and identically distributed continuous random variables. That is tools to prepare a time series as well as distribution-free trend and change-point tests and graphical tools to study the record occurrence. Details about the implemented tools can be found in Castillo-Mateo, et al. [95]. https://CRAN.R-project.org/package=RecordTest.
6. Future Work
Although much has been done with respect to record values theory, there is still scope for more work. Here, we discuss some open problems that the researchers may like to work on.
i. There is little work with respect to the theory of records for bivariate or multivariate random sequences. Therefore, we recommend more studies in this direction.
ii. Does the type of records affect the estimates of the parameters?. Tripathi, et al. [96] noted that the performance of the estimator depends on the type of records. However, the suitability of the type of record varies from one distribution to distribution. Therefore, this topic needs further study.
iii. Develop a methodology for conducting inference based on record values and record times. Where the record times and record values jointly contain considerably more information about distribution than do the record values alone, see Feuerverger and Hall [97].
iv. Instead of just using record values in inference, we suggest using the record values with their corresponding inter-record times, see Kizilaslan and Nadar [98], Arashi and Emadi [99].
v. Investigating the concept of records with respect to using the kernel density approach to characterize the behavior of records is an interesting extension of the theory of records. This approach will be very useful in cases where a classical distribution
cannot be identified to statistically fit the underlying data from which the record observations are obtained. vi. Serious difficulties arise for statistical inference based on records due to the fact that the occurrences of record data are very rare in practical situations [since the mean of the number of records in a random sample of size n is equal to 1 + 2-1 + —+ n-1 (see Arnold, et al. [100])] and the expected waiting time is infinite for every record after the first. Although, these problems can be avoided if we consider the model of k-record statistics introduced by Dziubdziela and Kopodnski [4]. However, research is still open to investigating this problem.
7. Conclusion
By reviewing the literature (we only mentioned some of them) on the record values in statistical inference, we conclude that the records have played a significant and widely utilized role for statistical inference in parameter estimation, predicting future values, hypothesis tests, as well as stress-strength tests and characterizing of distributions. For this purpose, various known statistical inference methods have been used, including Bayesian and non-Bayesian methods. We also conclude from the applications in previous studies that the records are not limited to a specific field, but rather comprehend all aspects of life including sports, health, medicine, insurance, economy, industry, climate, environment, floods, and rainfall. Overall, records play a critical role in statistical inference by providing the data needed to make informed decisions and draw accurate conclusions.
References
[1] K. Chandler, "The distribution and frequency of record values," Journal of the Royal Statistical Society: Series B (Methodological), vol. 14, no. 2, pp. 220-228, 1952.
[2] B. C. Arnold, N. Balakrishnan, and H. N. Nagaraja, Records. John Wiley & Sons, 2011.
[3] M. Ahsanullah and V. B. Nevzorov, "Records via probability theory," 2015.
[4] W. Dziubdziela and B. Kopocinski, "Limiting properties of the k-th record values," Applicationes Mathematicae, vol. 2, no. 15, pp. 187-190, 1976.
[5] Z. Grudzien, "Characterization of distribution of time limits in record statistics as well as distributions and moments of linear record statistics from the samples of random numbers," Praca Doktorska, UMCS, Lublin, 1982.
[6] Z. F. Jaheen, "Empirical Bayes inference for generalized exponential distribution based on records," Communications in Statistics-Theory and Methods, vol. 33, no. 8, pp. 1851-1861, 2004.
[7] J. Ahmadi and M. Doostparast, "Bayesian estimation and prediction for some life distributions based on record values," Statistical Papers, vol. 47, pp. 373-392, 2006.
[8] N. Balakrishnan and A. Stepanov, "On the Fisher information in record data," Statistics & probability letters, vol. 76, no. 5, pp. 537-545, 2006.
[9] M. Doostparast, "A note on estimation based on record data," Metrika, vol. 69, no. 1, pp. 69-80, 2009.
[10] S.-F. Wu, "Interval Estimation for the Two-Parameter Exponential Distribution Based on the Upper Record Values," Symmetry, vol. 14, no. 9, p. 1906, 2022.
[11] A. Asgharzadeh, S. Bagheri, N. Ibrahim, and M. Abubakar, "Optimal confidence regions for the two-parameter exponential distribution based on records," Computational Statistics, vol. 35, pp. 309-326, 2020.
[12] A. Baklizi, "Estimation of Pr (X< Y) using record values in the one and two parameter exponential distributions," Communications in Statistics—Theory and Methods, vol. 37, no. 5, pp.692-698, 2008.
[13] M. Ahsanullah and F. Aliev, "Some characterizations of exponential distribution by record values," Journal of Statistical Research, vol. 42, no. 2, pp. 41-46, 2008.
[14] N. Abd-El-Hakim and K. Sultan, "Maximum likelihood estimates of Weibull parameters based on record values," J. Egypt. Math. Soc, vol. 9, no. 1, pp. 79-89, 2001.
[15] A. A. Soliman, A. H. Abd Ellah, and K. S. Sultan, "Comparison of estimates using record statistics from Weibull model: Bayesian and non-Bayesian approaches," Computational Statistics & Data Analysis, vol. 51, no. 3, pp. 2065-2077, 2006.
[16] A. A. Jafari and H. Zakerzadeh, "Inference on the parameters of the Weibull distribution using records," arXivpreprint arXiv:1501.02201, 2015.
[17] B. X. Wang and Z.-S. Ye, "Inference on the Weibull distribution based on record values," Computational Statistics & Data Analysis, vol. 83, pp. 26-36, 2015.
[18] M. Z. Raqab, L. A. Alkhalfan, O. M. Bdair, and N. Balakrishnan, "Maximum likelihood prediction of records from 3-parameter Weibull distribution and some approximations," Journal of computational and applied mathematics, vol. 356, pp. 118-132, 2019.
[19] A. S. Hassan, H. F. Nagy, H. Z. Muhammed, and M. S. Saad, "Estimation of multicomponent stress-strength reliability following Weibull distribution based on upper record values," Journal of Taibah University for Science, vol. 14, no. 1, pp. 244-253, 2020.
[20] F. S. Al-Duais, "Bayesian estimations under the weighted LINEX loss function based on upper record values," Complexity, vol. 2021, pp. 1-7, 2021.
[21] M. Ahsanullah, "Estimation of the parameters of the Gumbel distribution based on the m record values," Comput. Statist. Quart, vol. 6, pp. 231-239, 1990.
[22] M. A. Mousa, Z. Jaheen, and A. Ahmad, "Bayesian estimation, prediction and characterization for the Gumbel model based on records," Statistics: A Journal of Theoretical and Applied Statistics, vol. 36, no. 1, pp. 65-74, 2002.
[23] I. Malinowska and D. Szynal, "On a family of Bayesian estimators and predictors for a Gumbel model based on the kth lower records," Applicationes Mathematicae, vol. 1, no. 31, pp.107-115, 2004.
[24] J. I. Seo and Y. Kim, "Statistical inference on Gumbel distribution using record values," Journal of the Korean Statistical Society, vol. 45, no. 3, pp. 342-357, 2016.
[25] A. Asgharzadeh, M. Abdi, and S. Nadarajah, "Interval estimation for Gumbel distribution using climate records," Bulletin of the Malaysian Mathematical Sciences Society, vol. 39, pp. 257-270, 2016.
[26] M. Ahsanullah and B. Holland, "Distributional properties of record values from the geometric distribution," Statistica neerlandica, vol. 41, no. 2, pp. 129-137, 1987.
[27] H. M. Okasha and J. Wang, "E-Bayesian estimation for the geometric model based on record statistics," Applied Mathematical Modelling, vol. 40, no. 1, pp. 658-670, 2016.
[28] G. Francis, E. Anjana, and E. Jeevanand, "Shrinkage Estimation of Strength Reliability for Geometric Distribution Using Record Values," Acta Scientific COMPUTER SCIENCES Volume, vol. 4, no. 4, 2022.
[29] B. C. Arnold and S. J. Press, "Bayesian estimation and prediction for Pareto data," Journal of the American Statistical Association, vol. 84, no. 408, pp. 1079-1084, 1989.
[30] A. El-Qasem, "Estimation via record values," Journal of Information and Optimization Sciences, vol. 17, no. 3, pp. 541-548, 1996.
[31] K. S. Sultan and M. E. Moshref, "Record values from generalized Pareto distribution and associated inference," Metrika, vol. 51, no. 2, pp. 105-116, 2000.
[32] M. Z. Raqab, J. Ahmadi, and M. Doostparast, "Statistical inference based on record data from Pareto model," Statistics, vol. 41, no. 2, pp. 105-118, 2007.
[33] M. Z. Raqab, "Distribution-free prediction intervals for the future current record statistics," Statistical Papers, vol. 50, pp. 429-439, 2009.
[34] M. Doostparast, M. G. Akbari, and N. Balakrishna, "Bayesian analysis for the two-parameter Pareto distribution based on record values and times," Journal of Statistical Computation and Simulation, vol. 81, no. 11, pp. 1393-1403, 2011.
[35] M. Ahsanullah and M. Shakil, "A note on the characterizations of Pareto distribution by upper record values," Communications of the Korean Mathematical Society, vol. 27, no. 4, pp.835-842, 2012.
[36] Q. J. Azhad, M. Arshad, and N. Khandelwal, "Statistical inference of reliability in multicomponent stress strength model for pareto distribution based on upper record values," International Journal of Modelling and Simulation, vol. 42, no. 2, pp. 319-334, 2022.
[37] M. T. Madi and M. Z. Raqab, "Bayesian prediction of rainfall records using the generalized exponential distribution," Environmetrics: The official journal of the International Environmetrics Society, vol. 18, no. 5, pp. 541-549, 2007.
[38] A. Baklizi, "Likelihood and Bayesian estimation of Pr (X< Y) using lower record values from the generalized exponential distribution," Computational Statistics & Data Analysis, vol. 52, no. 7, pp. 3468-3473, 2008.
[39] S. Dey, T. Dey, M. Salehi, and J. Ahmadi, "Bayesian inference of generalized exponential distribution based on lower record values," American Journal of Mathematical and Management Sciences, vol. 32, no. 1, pp. 1-18, 2013.
[40] S. Sana and M. Faizan, "Bayesian estimation using lindley's approximation and prediction of generalized exponential distribution based on lower record values," Journal of Statistics Applications & Probability, vol. 10, no. 1, pp. 61-75, 2021.
[41] N. Balakrishnan and P. Chan, "Record values from Rayleigh and Weibull distributions and associated inference," NIST special publication SP, pp. 41-41, 1994.
[42] M. Hendi, S. Abu-Youssef, and A. Alraddadi, "A Bayesian analysis of record statistics from the Rayleigh model," in International Mathematical Forum, 2007, vol. 2, no. 13, pp. 619-631.
[43] A. A. Soliman and F. M. Al-Aboud, "Bayesian inference using record values from Rayleigh model with application," European Journal of Operational Research, vol. 185, no. 2, pp. 659-672, 2008.
[44] M. Ahsanullah and M. Shakil, "Characterizations of Rayleigh distribution based on order statistics and record values," Bull. Malays. Math. Sci. Soc, vol. 36, no. 3, pp. 625-635, 2013.
[45] J.-I. Seo, J.-W. Jeon, and S.-B. Kang, "Exact interval inference for the two-parameter Rayleigh distribution based on the upper record values," Journal of Probability and Statistics, vol. 2016, 2016.
[46] J. I. Seo and Y. Kim, "Objective Bayesian inference based on upper record values from Rayleigh distribution," Communications for Statistical Applications and Methods, vol. 25, no. 4, pp. 411-430, 2018.
[47] M. Abdi and A. Asgharzadeh, "Rayleigh confidence regions based on record data," Journal of Statistical Research of Iran JSRI, vol. 14, no. 2, pp. 171-188, 2018.
[48] M.-Y. Lee and E.-H. Lim, "Characterizations of the Lomax, exponential and Pareto distributions by conditional expectations of record values," Journal of the Chungcheong Mathematical Society, vol. 22, no. 2, pp. 149-149, 2009.
[49] P. Nasiri and S. Hosseini, "Statistical inferences for Lomax distribution based on record values (Bayesian and classical)," Journal of Modern Applied Statistical Methods, vol. 11, no. 1, p. 15, 2012.
[50] M. A. Mahmoud, A. A. Soliman, A. H. Abd Ellah, and R. M. El-sagheer, "MCMC technique to study the Bayesian estimation using record values from the Lomax distribution," International Journal of Computer Applications, vol. 73, no. 5, 2013.
[51] M. A. Mahmoud, R. M. El-Sagheer, A. A. Soliman, and A. H. Abd Ellah, "Bayesian estimation of P [Y< X] based on record values from the Lomax distribution and MCMC technique," Journal of Modern Applied Statistical Methods, vol. 15, no. 1, p. 25, 2016.
[52] A. S. Hassan and A. N. Zaky, "Entropy Bayesian estimation for Lomax distribution based on record," Thailand Statistician, vol. 19, no. 1, pp. 95-114, 2021.
[53] M. A. Selim, "Estimation and prediction for Nadarajah-Haghighi distribution based on record values," Pak. J. Statist, vol. 34, no. 1, pp. 77-90, 2018.
[54] M. Sana and M. Faizan, "Bayesian estimation for Nadarajah-Haghighi distribution based on upper record values," Pakistan Journal of Statistics and Operation Research, pp. 217230, 2019.
[55] S. T. MirMostafaee, A. Asgharzadeh, and A. Fallah, "Record values from NH distribution and associated inference," Metron, vol. 74, pp. 37-59, 2016.
[56] S. E. Abu-Youssef, "On characterization of certain distributions of record values," Applied mathematics and computation, vol. 145, no. 2-3, pp. 443-450, 2003.
[57] J. Ahmadi, M. J. Jozani, É. Marchand, and A. Parsian, "Bayes estimation based on k-record data from a general class of distributions under balanced type loss functions," Journal of Statistical Planning and Inference, vol. 139, no. 3, pp. 1180-1189, 2009.
[58] I. Malinowska and D. Szynal, "On characterization of certain distributions of kth lower (upper) record values," Applied Mathematics and Computation, vol. 202, no. 1, pp. 338347, 2008.
[59] M. Ahsanullah, "Linear prediction of record values for the two parameter exponential distribution," Annals of the Institute of Statistical Mathematics, vol. 32, pp. 363-368, 1980.
[60] I. R. Dunsmore, "The future occurrence of records," Annals of the Institute of Statistical Mathematics, vol. 35, pp. 267-277, 1983.
[61] A. M. Awad and M. Z. Raqab, "Prediction intervals for the future record values from exponential distribution: comparative study," Journal of Statistical Computation and Simulation, vol. 65, no. 1-4, pp. 325-340, 2000.
[62] J. Ahmadi and S. MirMostafaee, "Prediction intervals for future records and order statistics coming from two parameter exponential distribution," Statistics & Probability Letters, vol. 79, no. 7, pp. 977-983, 2009.
[63] J. Paul and P. Y. Thomas, "On generalized upper (k) record values from Weibull distribution," Statistica, vol. 75, no. 3, pp. 313-330, 2015.
[64] G. Volovskiy and U. Kamps, "Likelihood-Based Prediction of Future Weibull Record Values," REVSTAT-Statistical Journal, vol. 21, no. 3, pp. 425-445-425-445, 2023.
[65] M. Ahsanullah, "Inference and prediction of the Gumbel distribution based on record values," Pakistan Journal of Statistics, vol. 7, no. 3, pp. 53-62, 1991.
[66] M. T. Madi and M. Z. Raqab, "Bayesian prediction of temperature records using the Pareto model," Environmetrics, vol. 15, no. 7, pp. 701-710, 2004.
[67] J. Paul and P. Y. Thomas, "On generalized (k) record values from Pareto distribution," Aligarh J Statist, vol. 36, no. 1, pp. 63-78, 2016.
[68] A. R. Shafay, N. Balakrishnan, and J. Ahmadi, "Bayesian prediction of order statistics with fixed and random sample sizes based on k-record values from Pareto distribution," Communications in Statistics-Theory and Methods, vol. 46, no. 2, pp. 721-735, 2017.
[69] Z. Vidovic, "Bayesian Prediction of Order Statistics Based on k-Record Values from a Generalized Exponential Distribution," Stats, vol. 2, no. 4, pp. 447-456, 2019.
[70] G. Volovskiy and U. Kamps, "Comparison of likelihood-based predictors of future Pareto and Lomax record values in terms of Pitman closeness," Communications in Statistics-Theory and Methods, vol. 52, no. 6, pp. 1905-1922, 2023.
[71] E. K. AL-Hussaini and A. E.-B. A. Ahmad, "On Bayesian predictive distributions of generalized order statistics," Metrika, vol. 57, pp. 165-176, 2003.
[72] J. Ahmadi, M. Jafari Jozani, É. Marchand, and A. Parsian, "Prediction of k-records from a general class of distributions under balanced type loss functions," Metrika, vol. 70, no. 1, pp. 19-33, 2009.
[73] M. Z. Raqab and N. Balakrishnan, "Prediction intervals for future records," Statistics & Probability Letters, vol. 78, no. 13, pp. 1955-1963, 2008.
[74] M. Nadar, A. Papadopoulos, and F. Kizilaslan, "Statistical analysis for Kumaraswamy's distribution based on record data," Statistical Papers, vol. 54, pp. 355-369, 2013.
[75] M. Chacko and M. S. Mary, "Estimation and prediction based on k-record values from normal distribution," Statistica, vol. 73, no. 4, pp. 505-516, 2013.
[76] J.-I. Seo and J. J. Song, "A bayesian nonparametric model for upper record data," Applied Mathematical Modelling, vol. 71, pp. 363-374, 2019.
[77] G. Volovskiy and U. Kamps, "Maximum product of spacings prediction of future record values," Metrika, vol. 83, no. 7, pp. 853-868, 2020.
[78] A. Asgharzadeh, A. Fallah, M. Raqab, and R. Valiollahi, "Statistical inference based on Lindley record data," Statistical Papers, vol. 59, pp. 759-779, 2018.
[79] M. Z. Raqab, O. M. Bdair, and F. M. Al-Aboud, "Inference for the two-parameter bathtub-shaped distribution based on record data," Metrika, vol. 81, pp. 229-253, 2018.
[80] A. Tripathi, U. Singh, and S. K. Singh, "Inferences for the DUS-exponential distribution based on upper record values," Annals of Data Science, vol. 8, pp. 387-403, 2021.
[81] R. R. A. Awwad, O. M. Bdair, and G. K. Abufoudeh, "Bayesian estimation and prediction based on Rayleigh record data with applications," Statistics in Transition new series, vol. 22, no. 3, pp. 59-79, 2021.
[82] D. Hinkley, "On quick choice of power transformation," Journal of the Royal Statistical Society: Series C (Applied Statistics), vol. 26, no. 1, pp. 67-69, 1977.
[83] M. Salehi and M. Doostparast, "Expressions for the mean of the order statistics from the skew-normal distribution and their application," The Proceeding of Refereed and Invited Papers, p. 477, 2015.
[84] S. Singh, Y. Mani Tripathi, and S.-J. Wu, "Bayesian estimation and prediction based on lognormal record values," Journal of Applied Statistics, vol. 44, no. 5, pp. 916-940, 2017.
[85] J. F. Lawless, Statistical models and methods for lifetime data. John Wiley & Sons, 2011.
[86] Z. Vidovic, "On MLEs of the parameters of a modified Weibull distribution based on record values," Journal of Applied Statistics, vol. 46, no. 4, pp. 715-724, 2019.
[87] L. Wang, Y. M. Tripathi, S.-J. Wu, and M. Zhang, "Inference for confidence sets of the generalized inverted exponential distribution under k-record values," Journal of Computational and Applied Mathematics, vol. 380, p. 112969, 2020.
[88] J. I. Seo and Y. Kim, "Objective Bayesian analysis based on upper record values from two-parameter Rayleigh distribution with partial information," Journal of Applied Statistics, vol. 44, no. 12, pp. 2222-2237, 2017.
[89] D. Kumar, M. Kumar, and J. Saran, "Power Generalized Weibull Distribution Based on Record Values and Associated Inferences with Bladder Cancer Data Example," Communications in Mathematics and Statistics, pp. 1-26, 2022.
[90] R. M. EL-Sagheer, M. S. Eliwa, K. M. Alqahtani, and M. El-Morshedy, "Bayesian and non-Bayesian inferential approaches under lower-recorded data with application to model COVID-19 data," AIMS Mathematics, vol. 7, no. 9, pp. 15965-15981, 2022.
[91] E. Fayyazishishavan and S. Kiliç Depren, "Inference of stress-strength reliability for two-parameter of exponentiated Gumbel distribution based on lower record values," Plos one, vol. 16, no. 4, p. e0249028, 2021.
[92] B. P. Carlin and A. E. Gelfand, "Parametric likelihood inference for record breaking problems," Biometrika, vol. 80, no. 3, pp. 507-515, 1993.
[93] M. S. Hoffman, "The world almanac and book of facts 1989. New York: Newspaper Enterprise Association," ed: Inc, 1988.
[94] C. Tani§, "Transmuted lower record type inverse rayleigh distribution: estimation, characterizations and applications," Ricerche di Matematica, vol. 71, no. 2, pp. 777-802, 2022.
[95] J. Castillo-Mateo, A. C. Cebrián, and J. Asín, "RecordTest: An R Package to Analyze Non-Stationarity in the Extremes Based on Record-Breaking Events," Journal of Statistical Software, vol. 106, pp. 1-28, 2023.
[96] A. Tripathi, U. Singh, and S. K. Singh, "Does the Type of Records Affect the Estimates of the Parameters?," Journal of Modern Applied Statistical Methods, vol. 19, no. 1, p. 27, 2022.
[97] A. Feuerverger and P. Hall, "On statistical inference based on record values," Extremes, vol. 1, pp. 169-190, 1998.
[98] F. Kizilaslan and M. Nadar, "Estimation and prediction of the Kumaraswamy distribution based on record values and inter-record times," Journal of Statistical Computation and Simulation, vol. 86, no. 12, pp. 2471-2493, 2016.
[99] M. Arashi and M. Emadi, "Evidential inference based on record data and inter-record times," Statistical Papers, vol. 49, pp. 291-301, 2008.
[100] B. Arnold, N. Balakrishnan, and H. Nagaraja, "Records. John Wiley&Sons," New York, 1998.
Appendix: R program
rinfal<-c(11.30, 20.34, 13.13, 10.4, 12.11, 38.18, 9.21, 22.31, 14.05, 13.87, 19.28, 34.84, 13.36, 11.85, 26.28, 6.73, 16.11, 8.51, 16.86, 7.06, 5.59, 7.91, 16.29, 10.6, 19.32, 8.72, 19.52, 18.65, 19.3, 11.72, 19.18, 12.63, 16.18, 11.6, 13.42, 23.65, 17.05, 19.92, 15.26, 13.86, 8.58, 12.52, 13.71, 19.66, 9.59, 6.67, 7.38, 17.56, 17.44, 9.77, 12.66, 12.5, 12.53, 16.95, 11.84, 14.55, 21.66, 12.07, 22.41, 23.43, 13.06, 18.96, 32.76, 11.18, 19.17, 19.21, 11.58, 12.13, 12.61, 7.22, 7.99, 10.6, 8.21, 26.21, 9.46, 11.99, 11.94, 16, 9.54, 21.13, 5.58, 8.18, 4.85, 18.79, 8.38, 7.93, 13.69, 20.44, 22, 16.58, 27.47, 7.77, 12.32, 7.17, 21.26, 14.92, 14.35, 7.22, 12.31, 33.44, 19.67, 26.98, 8.98, 10.71, 31.25, 10.43, 12.82, 17.86, 7.66, 12.48, 8.08, 7.35, 11.47, 21, 27.36, 8.11, 24.35, 12.46, 12.4, 31.01, 9.09, 11.57, 17.94, 4.42, 16.49, 9.24, 37.25, 13.19, 3.21, 13.53, 9.08, 16.36, 20.2, 8.69, 5.85, 6.08, 8.52, 9.65, 19, 4.79, 18.82, 14.86, 5.82, 12.4, 27.85)
> library(Records)
> lower.record.times(rinfal, 1)
[1] 1 4 7 16 21 81 83 124 129
lower.record.values(rinfal, 1)
[1] 11.30 10.40 9.21 6.73 5.59 5.58 4.85 4.42 3.21
> upper.record.times(rinfal, 1)
[1] 1 2 6
> upper.record.values(rinfal, 1)
[1] 11.30 20.34 38.18
> lower.record.times(rinfal, 2)
[1] 2 3 4 7 16 18 20 21 46 81 83 124 129
> lower.record.values(rinfal, 2)
[1] 20.34 13.13 11.30 10.40 9.21 8.51 7.06 6.73 6.67 5.59 5.58 4.85 4.42
> upper.record.times(rinfal, 2)
[1] 2 3 6 8 12 127
> upper.record.values(rinfal, 2)
[1] 11.30 13.13 20.34 22.31 34.84 37.25