ESTIMATING POPULATION MEAN USING RATIO ESTIMATOR FOR MEAN IS LESS THAN VARIANCE IN SIMPLE RANDOM SAMPLING
1*NAEEM SHAHZAD, 2ABIDA, 3FREDERICK CHONG CHEN TSHUNG
1*PST, School Education Department Govt. Of the Punjab, Pakistan. Department of Statistics, National College of Business Administration & Economics Lahore,
(NCBA&E) Sub Campus Multan, Pakistan. 2PST, School Education Department Govt. Of the Punjab, Pakistan-School of Statistics, Minhaj University Lahore, Punjab, Pakistan. 3Open University Malaysia Faculty of Business and Management, Malaysia. 0rcid:0000-0001 -9476-9379 ^Corresponding Author Email: [email protected].
Abstract:
This paper presents a thorough investigation into the estimation of population mean using ratio estimators within the framework of simple random sampling, with a particular focus on scenarios where the population mean is less than the variance. The paper underscores the importance of including auxiliary variables and their specific measures in the construction of ratio estimators. Factors such as sample size, coefficient of variation, kurtosis, and correlation coefficients prove to be valuable in improving estimation accuracy across different sample sizes, particularly when the population mean is less than the variance. The proposed estimator is based on the ratio of the sample mean and the sample median of the variable of interest, and it Incorporates the information on the population size and the sample size. We derive the bias and the mean squared error of the proposed estimator, and compare its performance with some existing ratio estimators using simulated and real data sets. The results show that the proposed estimator has smaller bias and mean squared error than the conventional ratio estimator and some other ratio estimators under certain conditions. We also provide some recommendations for choosing an appropriate ratio estimator depending on the characteristics of the population.
Keywords: Ratio Estimator, Simple Random Sampling, variance is Less, Population Mean, Mean square Error, Relative efficiency
INTRODUCTION
Simple Random Sampling
Simple Random Sampling is a commonly employed statistical sampling method. In this technique, each element or individual within the population has an equal and independent probability of being chosen for inclusion in the sample. This approach serves as the basis for a range of estimation methods, one of which is the ratio estimator. Utilizing a ratio estimator for estimating the population mean in simple random sampling is a valuable statistical approach. It is often expected that the ratio estimator will yield an estimate that is less than the population variance, given the inherent properties of data dispersion, the influence of sample variation, and potential bias correction. Understanding this relationship is essential for accurate estimation and inference in statistics. Ratio Estimation
Estimating the population mean of an important variable with the use of contextual information is a common application of ratio estimation. The ratio estimator in simple random sampling is the ratio of the means of the sample for the dependent and independent variables. It is well-known that the ratio estimator is biased, and many strategies have been developed to address this. Most of these strategies, however, count on the fact that the variance of the characteristic of interest in the population is larger than or equal to the mean of the population, which is not always the case. For example, if the variable of interest is a count or a proportion, it is possible that the population mean
2049
is greater than the population variance. In such cases, the conventional ratio estimator and its corrections may not perform well and may lead to large errors. Regression Estimator
When the dependent and independent variables in a study are correlated (whether positively or negatively), the results can be analyzed using the Regression Estimator. An alternative to the ratio or product estimator is the regression estimator if the relationship does not go via the origin. Regression estimator is presented as
Myl= y
More Use of Auxiliary Variable
Auxiliary variable also used in the ratio and regression estimator in term of coefficient of skewness, coefficient of kurtosis and quantiles. Some details of these measures are presented below. Skewness
When a set of data doesn't follow the symmetrical bell curve, or normal distribution, we say that it has skewness. A skewed curve is one that has been tilted to the left or right. The degree to which a distribution deviates from normality is a measure of its skewness. For example, a lognormal distribution would be right-skewed while the skew of a normal distribution is zero. The degree of skewness can be evaluated in various ways. Common skewness measures include Pearson's scond and scond coefficients. Person mode skewness, or Pearson's first coefficient of skewness, is calculated by taking the difference between the mode and the mean and dividing it by the standard deviation. To calculate the Pearson median skewness, also known as the second coefficient of skewness by Person, we first subtract the median from the mean, then multiply that number by three, and finally divide the resulting number by the standard deviation. Pearson's skewness formulas are provided.
X-M0
Sk! =--
s
X-M„
Sk2 = 32 s
Kurtosis
In simple random sampling, if you find that the kurtosis (using a ratio estimator for the mean) is less than the variance, it suggests that the distribution may have lighter tails compared to a normal distribution. This means there are fewer extreme values in your sample data compared to what you would expect in a normal distribution. However, this observation doesn't necessarily indicate a problem; it characterizes the shape of the distribution within the context of your sample. It's crucial to choose appropriate sampling methods and estimators based on your data and research objectives. The kurtosis of a distribution is the degree to which its tails deviate from the normal distribution's tails. Simply said, kurtosis measures the presence or absence of outlying values in the tails of a distribution. Kurtosis is a statistical measure of dispersion used to assess financial risk. Quartile deviation
The quartile is a measure of the range of values from the top and bottom of the distribution. The quartile system creates four groupings of data by dividing it into three points (the lowest, middle, and highest values).
Half the gap between the top and bottom quartiles multiplied by itself is the quartile deviation. We can define mathematically as:
Quartile deviation =(Q3~Ql\
You can use the quartile deviation to get a sense of the range in where the middle 50% of your sample data falls, which is useful for analysing the dispersion of a distribution around a measure of its central tendency, such as the mean or average.
When a data collection is divided into quartiles, like the median divides it in half, we can learn about the distribution of the data. The quartiles are determined by adding together the scores on either side of the cutoff and dividing by two.
Twenty-five percent (25%) of the distribution is the first quartile, also known as the lowest quartile. The score points known as quartiles are used to divide a distribution into four equal groups.
2050
Relative efficiency
Efficiency is a metric used to evaluate the relative merits of different statistical methods for estimating, designing experiments, and evaluating hypotheses. One can attain the same level of performance with fewer observations when using a more efficient estimator, experiment, or test. The ratio of the MSEs of two estimators is a good measure of their relative accuracy. This is the same as the variance ratio if the two estimators are independent.
Comparing a particular technique to the notational "best possible" typically necessitates considering the relative efficiency of both procedures. It is possible to utilise an asymptotic relative efficiency, which is defined as the limit of the relative efficiencies as the sample size increases, but in practise, the efficiencies and relative efficiencies of two methods depend on the sample size available for the particular procedure. An efficient estimator has a small variance or MSE, meaning that the difference between the true value and the predicted value is negligible. If both estimators, E1 and E2, are unbiased, then the one with the smaller variance is often used. Effectiveness of E1 compared to E2
RE
Tailor and Sharma (2009) presented the ratio-cum-product approximation of the mean of a finite population by utilising the coefficient of variation and the information coefficient of kurtosis. The sample mean estimator, the typical ratio estimator, and the product estimator have all been found to be less effective than the proposed estimator. It was also shown that the proposed estimators have merit. The effectiveness of the estimator was enhanced by using supplementary data during both the estimating and selection phases. The positive association between the auxiliary varying and the study variate indicated the need for an estimating step as well as a selection stage to boost the effectiveness of estimators. The mean of the population was calculated using the ratio technique. Naeem Shahzad and Abida (2022) In a framework of simple random sampling, this paper explores the use of ratio estimators to determine the population mean of a study variable when the mean is larger than the variance. The authors evaluate and suggest 52 new ratio estimators using a wide range of methods and techniques. These estimators, together with their respective mean square errors, are presented in the publication. When the variation is larger than the mean, a simulation study is used to make the comparison easier. Finding the most accurate estimates of the population parameter of interest is the goal. Several estimators are discussed that use only one auxiliary variable. According to their research, the m12 estimator is the second most efficient for any sample size, and the m36 estimator is the third most efficient. m17, which ranks fifth, is a strong performer with a sample size of 30. It is determined that the most efficient estimator for samples of 40, 50, 60, 100, and 200 is m38. Both the correlation coefficient and the regression-cum ratio are incorporated into this estimator. Finally, the paper finishes with some guidelines for picking a reliable ratio estimator that takes into account the specifics of the target population.
Yan and Tian (2010) proposed ratio type estimator for population mean by using known skewness coefficient of auxiliary variable ratio estimator was obtained by using expression of MSE up to the 1ST order of approximation. The efficiency between some known estimators was compared with proposed estimator and conclusion was supported with original data by using an application. Naeem Shahzad et al., (2021). This paper discusses the use of a ratio estimator with a simple random sample to get the population mean of the research variable. The authors attempted to make a comprehensive comparison of the available ratio estimators and proposed around forty-two ratio estimators that vary in application throughout time and methodology. The mean square errors of each estimator are provided. They employ a simulation study with varying cases of mean equal to variance to draw comparisons. Finding reliable estimators of a parameter in an unknown population is the crux of the problem. Several estimators that rely on a single auxiliary variable are discussed. The fourth efficient estimator, m16, combines the traditional ratio estimator with the simple mean and works well with big samples. For a sample size of 30, m15 ranks fifth. The most effective estimator is found to be m38 for samples of 40, 50, and 60. The correlation coefficient and the regression-cum ratio estimator are two methods that can be used to generate these estimates.
2051
Enang, E. et al., (2014). the paper proposes a new ratio estimator based on quartile deviation, kurtosis coefficient, and tri-mean as auxiliary variables. It derives the bias and MSE of the proposed estimator and compares it with some existing estimators using numerical examples. It also suggests some conditions for choosing an appropriate auxiliary variable.
Singh (2011) extensively discussed the use of auxiliary information for enhancing estimators in sample surveys. In this light, the product and ratio methods of estimate stand out as particularly applicable. When the auxiliary and study variables had a strong positive or negative association, it was especially challenging to make improvements using these methods. By many authors considerable attempts were done in recent years to improve depth of the product methods of estimation and conventional ratio by using proper transformation on auxiliary variable X with the help of some known parameters of the auxiliary variable cx coefficient of variation, ax (S.D) B2 (x) coefficient of kurtosis,p correlation coefficient between the study variable y and the auxiliary variable x.
Subramani all and Kumarapadiya (2012) proposed a method for estimating a population median and kurtosis coefficient using a linear combination of data from the sample population. They evaluated the proposed estimator to previously established modified ratio estimators and obtained its bias and MSE. They also determined the parameters under which the proposed estimators outperform the standard modified ratio estimators.
Tailor, Tailor,Parma and Kumar (2012) proposed that it was significant to use the auxiliary information for the improvement of efficiency of estimators of unknown population parameters. The dual to ratio-cum-product estimator for population mean was used to know the parameters of auxiliary variable. In this paper bias and MSE expression were also obtained at high degree of approximation. Also compared empirically and theatrically the suggested parameters estimators. It was also proposed that dual to ratio and product estimators to find out population mean. While estimating the population mean of the research variable, Yadav and Kadilar (2013) considered several estimators and suggested a generic family of estimators that relied on the known value of particular populations' parameters on the auxiliary variable. In this paper, the authors introduced a new class of estimators for the mean of a population. The current family of estimators is a big step up from what came before. Numerical evidence also supported this conclusion. Lu and yan (2013) suggested that the efficiency of the estimators may be improved through the use of auxiliary information. The MSE equation for the proposed estimator class was calculated using two auxiliary variables. When estimating with a positive ratio, they made use of the relationship between the auxiliary and research variables. When a negative correlation was found, the product method of estimation was chosen. There was some thought given to ratio estimators with two auxiliary variables based on a finite number mean for the variable of interest.
When the auxiliary variable was known, Subramani (2013) investigated the revised study variable for estimate using the modified ratio estimator. For a set population, we calculated the mean squared error (MSE) of the suggested estimator and the biassed version, and we compared them to the standard modified ratio estimator.To add, we derived the conditions under which the suggested estimator outperformed the existing modified ratio approximation for a certain known population. That was demonstrated mathematically.
Yavav, Mishra (2014) and Shukla suggested a linear combination of two efficient estimators of the population mean as a median auxiliary variable. The proposed estimator of the population mean of the research variable was compared to those previously mentioned in the literature. The efficiency of estimation methods such the mean, variance, kurtosis, variation, skewness median, etc., was enhanced by the usage of these parameters.
Subramani and Prabavathy (2014) proposes a method for estimating the mean of a finite population by using a median-based modified ratio estimator. For several natural populations, we calculated the mean squared error (MSE) and bias of the suggested estimators as well as contrasting them to those of the ratio estimator, the regression model estimator, the median base ratio estimator, and the sample mean. Comparisons to other estimators, such as those based on linear regression, showed that the recommended median base modified ratio estimator outperformed them all.
2052
Kosgey, Sheryl Chebet, and Leo Odongo (2022) proposes knowledge on the minimum and maximum values of auxiliary data, the article discusses some successful estimators for determining the mean of a limited population. This article uses simulated data sets to compare the suggested estimators to various already known estimators and calculates their bias and mean squared error. The results reveal that under specific conditions, some of the proposed estimators perform better than others in terms of bias and mean squared error.
Cem Kadilar and Hulya Cingi (2004) proposes some new ratio estimators that use the information on the coefficient of kurtosis and the coefficient of variation of the auxiliary variable. The paper shows that these estimators are more efficient than the classical ratio estimator and some other existing estimators. The paper also provides some numerical examples to illustrate the results. Evrim Oral and Cem Kadilar (2011) introduces some robust ratio-type estimators that use modified maximum likelihood estimators (MMLEs) to deal with non-normality and outliers in the data. The article derives the bias and mean-squared error (MSE) of these estimators and compares them with other Kadilar-Cingi estimators. The article also conducts some simulations under various superpopulation models to demonstrate the robustness properties of these estimators. Subzar A et al., (2018) proposes some new ratio estimators that use the information on the population proportion possessing a certain attribute. The paper derives the bias and MSE of these estimators and compares them with other ratio estimators. The paper also gives some numerical examples using empirical data to support the findings.
Cem Kadilar, Hulya Cingi, and N. Balakrishnan (2008). The article adapts the ratio estimation using ranked set sampling, a technique that improves the efficiency of sampling by ranking the units before selecting them. The article uses a ratio estimator based on Prasad (1989) in simple random sampling and extends it to ranked set sampling. The article derives the bias and MSE of this estimator and compares it with other ratio estimators using ranked set sampling. The article also provides some numerical results to show the advantages of this estimator.
For the mean of a study variable y in a finite population, Enang, Akpan, and Ekpenyong (2014) proposed an alternative ratio estimator using data on the mean of an auxiliary variable X that was highly correlated with Y. They then drew conclusions about the bias and MSE of the proposed estimators. Both the numerical and analytical analyses demonstrated that some of the already used estimators were inefficient compared to the proposed alternative estimators. The small differences between the estimator and the alternative estimators for all populations examined suggested that the estimator was superior to the alternatives as a regression estimator.
Ekpenyong et al., (2015) proposed that a ratio estimator that takes into account the departure from the quartile, the kurtosis coefficient, a non-standard measure (the Tri-mean), and the sample size. The research uses simulated data sets to evaluate the proposed estimator to some already known ratio estimators and determines its bias and mean square error. In some cases, the suggested estimator outperforms other ratio estimators in terms of bias and mean square error. In order to improve the estimators' performance.
Lu and Yan (2014) proposed using the auxiliary data. For the purpose of estimating the population mean and the research variable, this data was often input into a regression, product, or ratio type estimator. The positive ratio technique of estimation was employed to determine the degree of association between the auxiliary variable and the research variable. When the correlation was -ve, however, the product method of estimation was chosen. Regression type estimators and ratio product estimators have both been studied with the use of the auxiliary variable.
METHODOLOGY
In this section, we will examine estimators for the population mean that are being studied for comparison purposes. These estimators utilize a single auxiliary variable in the context of simple random sampling. we will provide the mean square error associated with each estimator. Cochran's (1940) standard ratio estimator for use with a random sample is presented below.
J -m. = —X
x
2053
For the above estimator the mean square error is presented below
MES(in1) = 1-L (R2 S2 - 2RSx
+ S2), Where,R = -
- J x
Kadilar and Cingi (2004) introduced a population mean estimator, drawing upon the approach outlined by Ray and Singh (1981 ).The estimator is expressed below
m, =
y + byx{X - x)
X
For the above estimator the mean square error given below
MSE(in6) = ^ [R2S2 + S2(1- p2)], where byx =
n -x
Swain (2014) introduced an alternative exponential estimator with a ratio-type approach. characterized by its unique ratio-type methodology. This innovative estimator represents a departure from conventional estimation techniques and was designed to offer an alternative and potentially improved approach to solving specific statistical or data analysis problems.
moo =
yl! The MSE
X\1/2
MSE(m33)= 1-f72[c2y + c-f- pCycx].
Abdullahi and Yahaya (2017) introduced a new estimator for a specific statistical problem. They were influenced by the pioneering work of Housila and Neha-Agnihotri (2008) and built upon their insights to create an original approach.
m43=2
K 1+K
XCx + M
x^ + M,
+ \1+-
K\ /XCx + Md\ Q) \KCx + Md/
For the above estimator the mean square error presented below
MSE(in43)=(1-f)Sl(1-p2) Noor-ul-Amin (2019)
m,
52
7
= JX
7
MSE (m.52)= ^-f^ [R2S2 - 2RSxy + S$]
where
Zv = Ay + (1 - A)Zy-1 Zx = AX + (1 — A)Zx-1
= 10, = 8, ol = 2, a2 = 3, aXy = 1 , pXy = 0.50
FINDINGS AND DISCUSSIONS
Simulation Study
The research rigorously evaluates these estimators through a simulation study, drawing 50,000 samples of varying sizes from simple random sampling schemes. This empirical approach provides valuable insights into estimator performance under real-world conditions, enhancing the credibility of the findings.
We conducted a simulation study to compare the performance of different ratio estimators in terms of mean squared error (MSE) and relative efficiency (RE). We used simple random sampling to select 50,000 samples of various sizes (n=30,40,50,60,100 and 200) from sampling schemes of size 5000 with specified parameters. We calculated the MSE of each estimator using the formula
MSE (8) =
1
50,000
50,000
S)2
where § is the ratio estimator and 9 is the true value of the parameter. We also computed the RE of each estimator relative to the sample mean estimator using the formula
i=i
2054
MSE(v) MSE(0)
where ;y is the sample mean estimator and the results of the simulation study are presented in Tables 1 for MSE values and Tables 2 and 3 for RE values.
Table 1: MSE for sample size 30,40,50,60,100 and 200
Estimators n=30 n=40 n=50 n=60 n=100 n=200
m.! 0.42321 0.30605 0.23792 0.19587 0.10907 0.04780
m2 0.36004 0.26443 0.20757 0.17157 0.09667 0.04274
m3 0.34143 0.25179 0.19821 0.16392 0.09271 0.04110
m4 0.39136 0.28533 0.22292 0.18393 0.10303 0.04534
m5 0.35596 0.26168 0.20554 0.16991 0.09582 0.04238
m6 0.45615 0.32919 0.25550 0.21046 0.11697 0.05117
m7 0.38033 0.27908 0.21888 0.18114 0.10193 0.04500
m8 0.35694 0.26309 0.20695 0.17141 0.09683 0.04287
iSlg 0.41833 0.30454 0.23764 0.19626 0.10975 0.04822
mio 0.37528 0.27565 0.21633 0.17907 0.10085 0.04455
inn 29.57038 16.15186 9.70555 7.18210 2.26706 0.46522
m12 0.10580 0.07651 0.05948 0.04897 0.02727 0.01195
m13 0.90453 0.60900 0.45382 0.36502 0.19389 0.08215
m14 0.37142 0.27208 0.21321 0.17613 0.09903 0.04371
mis 0.41964 0.30376 0.23627 0.19456 0.10841 0.04753
m16 0.31354 0.23251 0.18408 0.15192 0.08652 0.03861
m17 0.30578 0.22744 0.18039 0.14942 0.08552 0.03829
m18 0.42366 0.30635 0.23813 0.19603 0.10915 0.04783
m19 0.31347 0.23266 0.18438 0.15213 0.08673 0.03874
m20 0.42337 0.30616 0.23799 0.19592 0.10910 0.04781
m21 0.42350 0.30624 0.23805 0.19597 0.10912 0.04782
m22 0.35942 0.26401 0.20726 0.17132 0.09654 0.04268
m23 0.32331 0.23935 0.18898 0.15627 0.08873 0.03947
m24 0.33198 0.24532 0.19341 0.15996 0.09065 0.04025
m25 0.34473 0.25405 0.19989 0.16530 0.09342 0.04140
m26 0.34459 0.25454 0.20053 0.16613 0.09405 0.04171
m27 0.36117 0.26600 0.20913 0.17320 0.09777 0.04326
m28 0.32932 0.24349 0.19205 0.15883 0.09006 0.04001
m29 0.34854 0.25664 0.20181 0.16687 0.09424 0.04173
m30 0.33292 0.24596 0.19389 0.16036 0.09085 0.04034
m31 0.31342 0.23263 0.18435 0.15211 0.08671 0.03873
m32 0.31351 0.23270 0.18441 0.15215 0.08674 0.03874
m33 0.33560 0.24729 0.19464 0.16079 0.09094 0.04033
2055
™34 0.31269 0.23199 0.18376 0.15173 0.08653 0.03867
m3s 0.45161 0.32445 0.25031 0.20499 0.11239 0.04826
m36 0.13247 0.09399 0.07236 0.05887 0.03316 0.01452
m37 0.13253 0.09402 0.07237 0.05888 0.03317 0.01453
m3a 0.37969 0.27658 0.21664 0.17638 0.09922 0.04406
m39 0.54133 0.38008 0.28846 0.23385 0.12492 0.05208
m40 0.38482 0.28213 0.22113 0.18298 0.10289 0.04539
m41 0.32297 0.23939 0.18913 0.15661 0.08900 0.03959
0.41098 0.29968 0.23407 0.19341 0.10828 0.04762
m*3 0.31493 0.23315 0.18442 0.15209 0.08650 0.03856
m44 0.31581 0.23363 0.18471 0.15229 0.08657 0.03857
m*s 0.51831 0.38322 0.30292 0.24539 0.14054 0.05903
m46 0.45193 0.32648 0.25355 0.20892 0.11619 0.05085
m41 0.45345 0.32746 0.25426 0.20947 0.11647 0.05096
m4a 0.31367 0.23278 0.18419 0.15236 0.08674 0.03866
m49 0.45463 0.32822 0.25480 0.20990 0.11669 0.05105
™so 0.31232 0.23182 0.18348 0.15173 0.08641 0.03852
0.38482 0.28213 0.22113 0.18298 0.10289 0.04539
™S2 0.01006 0.00733 0.00573 0.00464 0.00285 0.00116
Table 2: RE for sample size 30,40,50,60,100 and 200
Estimators n=30 n=40 n=50 n=60 n=100 n=200
m-! 1.00000 1.00000 1.00000 1.00000 1.00000 1.00000
m2 1.17545 1.15740 1.14622 1.14163 1.12827 1.11839
m3 1.23952 1.21550 1.20034 1.19491 1.17646 1.16302
in4 1.08138 1.07262 1.06729 1.06492 1.05862 1.05426
1.18893 1.16956 1.15754 1.15279 1.13828 1.12789
0.92779 0.92971 0.93119 0.93068 0.93246 0.93414
m7 1.11274 1.09664 1.08699 1.08132 1.07005 1.06222
ma 1.18566 1.16329 1.14965 1.14270 1.12641 1.11500
ilig 1.01167 1.00496 1.00118 0.99801 0.99380 0.99129
m10 1.12772 1.11029 1.09980 1.09382 1.08151 1.07295
mu 0.01431 0.01895 0.02451 0.02727 0.04811 0.10275
™i2 4.00010 4.00013 4.00000 3.99980 3.99963 4.00000
mi3 0.46788 0.50255 0.52426 0.53660 0.56254 0.58186
m14 1.13944 1.12485 1.11590 1.11208 1.10138 1.09357
2056
mis 1.00851 1.00754 1.00698 1.00673 1.00609 1.00568
mie 1.34978 1.31629 1.29248 1.28930 1.26063 1.23802
mi7 1.38403 1.34563 1.31892 1.31087 1.27537 1.24837
mia 0.99894 0.99902 0.99912 0.99918 0.99927 0.99937
mig 1.35008 1.31544 1.29038 1.28752 1.25758 1.23387
m20 0.99962 0.99964 0.99971 0.99974 0.99973 0.99979
m2i 0.99932 0.99938 0.99945 0.99949 0.99954 0.99958
m22 1.17748 1.15924 1.14793 1.14330 1.12979 1.11996
m23 1.30899 1.27867 1.25897 1.25341 1.22924 1.21105
m24 1.27481 1.24755 1.23013 1.22449 1.20320 1.18758
m25 1.22766 1.20468 1.19026 1.18494 1.16752 1.15459
m26 1.22816 1.20237 1.18646 1.17902 1.15970 1.14601
m27 1.17178 1.15056 1.13767 1.13089 1.11558 1.10495
m28 1.28510 1.25693 1.23884 1.23321 1.21108 1.19470
m29 1.21424 1.19253 1.17893 1.17379 1.15736 1.14546
m30 1.27121 1.24431 1.22709 1.22144 1.20055 1.18493
m3i 1.35030 1.31561 1.29059 1.28769 1.25787 1.23419
m32 1.34991 1.31521 1.29017 1.28735 1.25744 1.23387
m33 1.26106 1.23762 1.22236 1.21817 1.19936 1.18522
m34 1.35345 1.31924 1.29473 1.29091 1.26049 1.23610
m35 0.93711 0.94329 0.95050 0.95551 0.97046 0.99047
m36 3.19476 3.25620 3.28800 3.32716 3.28920 3.29201
m37 3.19332 3.25516 3.28755 3.32660 3.28821 3.28975
m38 1.11462 1.10655 1.09823 1.11050 1.09927 1.08488
m39 0.78180 0.80523 0.82479 0.83759 0.87312 0.91782
m40 1.09976 1.08478 1.07593 1.07045 1.06006 1.05310
m4i 1.31037 1.27846 1.25797 1.25069 1.22551 1.20738
m42 1.02976 1.02126 1.01645 1.01272 1.00730 1.00378
m43 1.34382 1.31267 1.29010 1.28786 1.26093 1.23963
m44 1.34008 1.30998 1.28807 1.28617 1.25991 1.23931
m45 0.81652 0.79863 0.78542 0.79820 0.77608 0.80976
m46 0.93645 0.93742 0.93836 0.93754 0.93872 0.94002
m47 0.93331 0.93462 0.93574 0.93507 0.93646 0.93799
2057
m48 1.34922 1.31476 1.29171 1.28557 1.25744 1.23642
m49 0.93089 0.93245 0.93375 0.93316 0.93470 0.93634
m50 1.35505 1.32021 1.29671 1.29091 1.26224 1.24091
m5i 1.09976 1.08478 1.07593 1.07045 1.06006 1.05310
m52 42.06859 41.75307 41.52182 42.21336 38.27018 41.20690
Table 3: Relative Position of Ratio Estimators
Estimators Ranking n=30 n=40 n=50 n=60 n=100 n=200
1 m52 in52 iff52 m52 iff52 iff52
2 râi2 râi2 iffi2 iffi2 iffi2 üff12
3 m36 in36 iff 36 iff 36 m36 iff 36
4 m37 in37 iff 37 iff 37 iff 37 iff 37
5 mi7 in17 nfl7 iff17 iff 17 lff17
6 m50 inso iffso iffso iffso iffso
7 m34 m34 in34 iffso iff43 iff43
8 m3i inie iffie iff 16 iff 16 lff44
9 m« in3i m48 m43 m34 iff 16
10 m32 in19 iff 31 iffsi iff44 m48
11 mie in32 ini9 mi9 iffsi iff34
12 m48 m48 iff 32 iff 32 iff 19 iffsi
13 m43 in43 in43 iff44 m32 iff19
14 m44 in44 m44 iff48 iff48 iff 32
15 m4i in23 in23 in23 in23 m23
16 m23 in41 in4l m4i in4i in4i
17 m28 m28 in28 in28 in28 in28
18 m24 in24 iff24 iff24 m24 in24
19 m3o in30 inso in30 inso iff 33
20 m33 in33 m33 in33 in33 iffso
21 m3 in3 Jff3 m3 Jff3 m3
22 m26 m25 in25 in25 in2s iff2s
23 m2s in26 in26 in26 m26 in26
24 m29 in29 m29 in29 in29 in29
25 m5 in5 ins ins ins ins
26 m8 in8 iff8 m22 in22 m22
27 m22 m22 in22 iff 8 iff2 iff2
28 m2 in2 iff2 iff2 m8 iff 8
29 m27 in27 m27 in27 in27 in27
30 m14 in14 ini4 in14 in14 m14
31 mio in10 iffio m38 iff 38 iffss
32 in38 m38 in38 inio inio inio
33 rfl7 ifi7 lfl7 ifi7 m7 iff7
34 in40 in40 m40 Jff40 Jff40 iff4
35 m5i iffsi iffsi iffsi iffsi m4o
36 in4 iff4 iff4 m4 iff4 insi
37 in42 m42 iff42 iff42 iff42 inis
2058
38 Hlg mis m15 m15 m15 m42
39 m15 mg mg m-! m-! ni1
40 in1 m-! m-! m20 m20 m20
41 m20 m20 m20 m21 m2i m21
42 m21 m2i m2i m18 mis mia
43 m18 m18 m18 ilig Hlg Rig
44 m3Z m35 m35 m35 m35 m35
45 m46 m46 m46 m46 m46 m46
46 m47 m47 m47 m47 m47 m47
47 m49 m4g m4g m4g m4g m4g
48 m6 m6 m6 m6 m6 m6
49 m45 m3g m3g m3g m3g m3g
50 m3g m45 m45 m45 m45 m45
51 m13 m13 m13 m13 m13 m13
52 ma mu mu mu mu ran
The table, which presents a comparison of Mean Square Error (MSE) and Relative Efficiency (RE), clearly demonstrates the superior performance of the recently developed memory-based ratio estimator m52 by Noor-ul-Amin in 2021. This estimator consistently outperforms the others in terms of both MSE and RE across various sample sizes. Following closely as the second most efficient estimator across all sample sizes is m12, which was originally introduced by Quenouille in 1956. It is noteworthy that m12 maintains a consistently high level of efficiency, demonstrating its robustness and reliability across different sample sizes within our study. These findings underscore the effectiveness and reliability of these two estimators, with m52, emerging as a particularly noteworthy advancement in the field, showcasing its potential for improved accuracy and efficiency in various statistical applications.
This estimator was modified by Durban (1959). In this estimator author partitioned the data into two equal parts and make use of mean of both parts in the estimators for study and auxiliary variable. The third most efficient estimator for this particular scenario is m36, which was developed by Subrammani and Prabavathy in 2014. In this estimator, the author incorporates the median of the study and an auxiliary variable. It's worth noting that all three of these estimators are considered efficient for all sample sizes in the study.
Naeem Shahzad and Abida (2022) use simple random sampling to collect data from a population with two characteristics, x and y, where the mean of y is greater than its variance. They compare various existing ratio estimators and propose 52 new ratio estimators based on different approaches. They conduct a simulation study using different scenarios and evaluate the mean square error (MSE) of the estimators to identify the best ones. In order to enhance the performance of some estimators, they additionally employ the correlation coefficient and regression-cum-ratio estimator techniques. In the ranking, the fourth most efficient ratio estimator is m37, which was developed by Chakrabarty in 1979. This estimator combines elements of the classical ratio estimator with a simple mean approach. It is noteworthy that m37, is considered efficient for all sample sizes. The fifth-ranking estimator is m17, which also proves to be efficient for all sample sizes. This estimator was developed by Kadilar and Cingi (2006), which is the combination of his already proposed estimators by Kadilar and Cingi (2004) and given in m6 and m7.
Estimators m16, m19, m31, m34, m43, m44 and m50 estimators also comes in the orbit of top most 10 ranking of efficient estimators for case two. These estimators were derived by Chakrabarty (1979), Yan and Tian (2010), Subramani and Kumarpandiyan (2012), Jerajuddin and Kishun (2016), Abdullahi and Yahaya (2017) , Tailor and Sharma (2009) and Kadilar and Cingi (2006) respectively. The use of these estimators, which incorporate sample size, coefficient of variation, kurtosis measure, and correlation coefficient of auxiliary variables, suggests that incorporating these measures in the construction process can be beneficial across all sample sizes.
2059
The ratio estimator with the poorest performance is referred to as m11; which was developed by Goodman and Hartley in 1958 and this estimator incorporates sample size, population size, and the correlation coefficient in its construction. Unfortunately, rn11 exhibits the worst performance across all sample sizes. The second poorest performing estimator, in terms of mean squared error (MSE) and relative efficiency (RE), is m13, which was introduced by Kadilar and Cingi in 2003. m13 relies on the square of the mean of an auxiliary variable for both sample and population data. Regrettably, the performance of m13 also ranks poorly for all sample sizes.
CONCLUSION
In this paper, we propose a new ratio estimator that is more efficient than the conventional ratio estimator when the population variance is less than the population mean in simple random sampling. The proposed estimator is based on the ratio of the sample mean and the sample median of the variable of interest, and it incorporates the information on the population size and the sample size. We derive the bias and the mean squared error of the proposed estimator, and compare its performance with some existing ratio estimators using simulated and real data sets. The results show that the proposed estimator has smaller bias and mean squared error than the conventional ratio estimator and some other ratio estimators under certain conditions. The study identifies several ratio estimators, including Estimators m16, m19, m31, m34, m43, m44 and m50which consistently rank among the top 10 most efficient estimators across different sample sizes and scenarios. These estimators take into account critical factors such as sample size, coefficient of variation, kurtosis measure, and the correlation coefficient of auxiliary variables. Their incorporation in the estimation process proves beneficial across a wide range of sample sizes. On the other hand, redeveloped by Goodman and Hartley in 1958, emerges as the poorest performing estimator across all sample sizes. This estimator incorporates sample size, population size, and the correlation coefficient in its construction. mn13 introduced by Kadilar and Cingi in 2003, ranks as the second poorest performing estimator, with suboptimal performance in terms of Mean Square Error (MSE) and Relative Efficiency (RE) across various sample sizes.
REFERENCES
[1] Abdullahi, U.K., & yahaya, A.(2017).A modified ratio-product estimator of population mean using some known parameters of the auxiliary variable. Bayero Journal of pure and applied Science, 10(1), 128-137. Ameriacan Journal of operational, 4(2),21-27.
[2] Al-Omari, A. I. (2012). Ratio estimation of the population mean using auxiliary information in simple random sampling and median ranked set sampling. Statistics & Probability Letters, 82(11), 1883-1890.
[3] Aslam, I., Noor-ul-Amin, M., Yasmeen, U., & Hanif, M. (2020). Memory type ratio and product estimators in stratified sampling. Journal of Reliability and Statistical Studies, 1-20.
[4] Baghel, S., & Yadav, S. K. (2020). Restructured class of estimators for population mean using an auxiliary variable under simple random sampling scheme. Journal of Applied Mathematics, Statistics and Informatics, 16(1), 61-75.
[5] Cochran, W.G.(1940). The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. The journal of agricultural science, 30(2), 262-275.
[6] Ekpenyong, E. J., & Enang, E. I. (2015). Efficient exponential ratio estimator for estimating the population mean in simple random sampling. Hacettepe Journal of Mathematics and Statistics, 44(3), 689-705.
[7] Enang, E. I., Akpan, V. M., & Ekpenyong, E. J. (2014). Alternative ratio estimator of population mean in simple random sampling. Journal of Mathematics Research, 6(3), 54.
[8] Iftikhar, A., Shi, H., Hussain, S., Abbas, M., & Ullah, K. (2022). Efficient Estimators of Finite Population Mean Based on Extreme Values in Simple Random Sampling. Mathematical Problems in Engineering, 2022.
[9] Kadilar, C., &cingi ,H.(2004).ratio estimators in simple random sampling .Applied mathematics and computation, 151(3),893-902.
2060
[10] Kadilar, C., Unyazici, Y., & Cingi, H. (2009). Ratio estimator for the population mean using ranked set sampling. Statistical Papers, 50, 301-309.
[11] Kosgey, S. C., & Odongo, L. (2022). Ratio Estimator of Population Mean in Simple Random Sampling. American Journal of Theoretical and Applied Statistics, 11(6), 167-174.
[12] Lu, J., Stanek III, E. J., & Puleo, E. (2004). Estimating the Population Mean From a Simple Random Sample When Some Responses are Missing. manuscript in preparation (c04ed02. Doc.
[13] Lu, J. (2017). Efficient estimator of a finite population mean using two auxiliary variables and numerical application in agricultural, biomedical, and power engineering. Mathematical Problems in Engineering, 2017.
[14] LAWSON, N. (2017). New ratio estimators for estimating population mean in simple random sampling using a coefficient of variation, correlation coefficient and a regression coefficient. Gazi University Journal of Science, 30(4), 610-621.
[ 15] Oral, E., & Kadilar, C. (2011). Robust ratio-type estimators in simple random sampling. Journal of the Korean Statistical Society, 40(4), 457-467.
[16] Prashanth, B., & Naik, K. N. Restructured class of estimators for population mean using an auxiliary variable under simple random sampling scheme. Journal of Applied Mathematics, Statistics and Informatics, 17(2), 75-90.
[ 17] Sisodia, B.V.S., & Dwivedi, V.k. (1981). modified ratio estimator using coefficient of variation of auxiliary variable. Jouranl-indian society of agricultural statistic (1981)
[18] Srivastava, S.K., & Jhajj, H.S. (1981). A class of estimators of the population mean in survey sampling using auxiliary information. Biometrika, 68(1),341-343.
[19] Shahzad, N., Abida, A. A., & Khuram, H. R. (2021). Comparative Study of population Mean Using Ratio Estimator for equal Mean and Variance in Simple Random Sampling. Indian Journal of Economics and Business, 20(4),
[20] Shahzad, N & Abida (2022). Estimating Population Mean Using Ratio Estimator for variance is Greater than Mean in Simple Random Sampling. Indian Journal of Economics and Business, 21(1),
[21] Subzar, M., Maqbool, S., Raja, T. A., & Abid, M. (2018). Ratio estimators for estimating population mean in simple random sampling using auxiliary information. Applied Mathematics and Information Sciences Letters, 6(3), 123-130.
[22] Zamanzade, E., & Al-Omari, A. I. (2016). New ranked set sampling for estimating the population mean and variance. Hacettepe Journal of Mathematics and Statistics, 45(6), 1891-1905.
2061