A New Regression Type Estimator and Its Application in Survey Sampling ()
1. Introduction
The use of auxiliary information has increased widely in survey sampling to improve the precision of estimates. A lot of works have been done by many authors using auxiliary information. The usage of the transformed auxiliary variable in estimating finite population mean is presented in [1]. The use of two auxiliary variables in estimation has been introduced by [2]. A modified ratio estimator for the estimation of finite population mean when median of the auxiliary variable was available is shown in [3]. The modified estimator shows the significant role using of auxiliary information. The cum-dual ratio estimator was more efficient than other existing estimator proved by [4]. A new modified product estimator is better than other existing estimator when median of the auxiliary variable is known [5].
Regression estimator is important in estimation theory for its improved precision. It has been widely used in estimating population parameter. Regression estimator is more useful in many practical situations including business, economics, time series forecasting, agriculture etc. [6]. Before applying regression estimator, the regression parameter needs to be estimated. In this study, we proposed new regression type estimator where correlation coefficient is used instead of regression parameter. In many practical situations, population correlation coefficients are known [7].
The structure of the paper is as follows: Section 2 contains the concept of proposed estimator and its bias and mean square error. Section 3 shows the efficiency comparison of proposed estimator compared to that of ratio, product and simple random sampling mean estimator. Section 4 illustrates simulation study of the proposed estimator. Section 5 presents the real-life example of proposed estimator. Finally, section 6 provides the conclusion of our study.
2. Proposed Estimator
In this section we proposed our new estimator. For this purpose, we derived bias and mean square error of proposed estimator.
Consider a finite population
of size N units. Let y and x denote the study variable and auxiliary variable respectively. For estimating population mean
, a sample of size
is drawn using simple random sampling without replacement (SRSWOR) from the population U.
Where, the population mean is
.
Hansen, Hurwitz and Madow (1953) consider the linear regression estimator of the population mean
as
where,
an estimate of regression coefficient is
.
The new regression type estimator for estimating population mean
as
(1)
where,
is an estimate of correlation coefficient
.
To calculate bias and mean square error of new regression type estimator, we consider first degree approximation.
Let us define [6]
This implies that,
Theorem 1. The bias of the proposed estimator
of population mean
is given by
Proof:
From Equation (1) we can write,
The proposed estimator
, in terms of
and
can be easily written as
(2)
Using binomial expansion, we consider first order approximation and ignoring the higher order approximation. Hence, Equation (2) can be written as
(3)
After simplification, Equation (3) can be expressed as
(4)
Now, taking expectation of both sides of Equation (4) we get,
Simplifying this, we get the bias of the estimator
. Hence this is the theorem.
Theorem 2. The mean squared error of the proposed estimator
, to the first order approximation is
Proof:
Squaring both sides of Equation (3) and then taking expectation
By the definition of mean squared error (MSE), we have
Hence this is the theorem.
3. Efficiency Comparison
This section illustrates the conditions, for which the mean square error of newly proposed regression type estimator will be minimum compared to that of simple random sampling mean estimator, product estimator and ratio estimator.
Theorem 3. The proposed estimator
is more efficient than the product estimator
if
Proof:
The proposed estimator
is more efficient than product estimator
if
Now there are two cases:
Case 1. The inequality will be satisfied if
This condition holds in practice.
Case 2. The inequality will be satisfied if
This condition holds in practice.
Theorem 4. The estimator
is more efficient than the ratio estimator
if
Proof:
The proposed estimator
is more efficient than ratio estimator
if
Now there are two cases:
Case 1. The inequality will be satisfied if
This condition holds in practice.
Case 2. The inequality will be satisfied if
This condition holds in practice.
Theorem 5. The estimator
is more efficient than simple random sample mean estimator if
Proof:
The proposed estimator
is more efficient than simple random variance estimator
if
This condition holds in practice.
Theorem 6. The estimator
is more efficient than simple linear regression estimator
if
Proof:
The proposed estimator
is more efficient than simple linear regression estimator
if
This condition does not hold in practice.
4. Simulation Study
In this section, we will discuss the simulation study for the proposed estimator. For this purpose, we have to generate data from standard normal distribution. Steps for generating data from normal distribution and estimating relative efficiency of different estimator with simple random sampling estimator are given below:
Step 1: Choose a random sample of size n.
Step 2: Generate normal random variable with specified mean and variance.
Step 3: Calculate mean square error and relative efficiency of considered estimator.
Step 4: Repeat step1 to step3 until desired result is obtained.
Table 1 provides the estimated values of Mean Square Error and Relative Efficiency of proposed estimator and different estimators for simulated data.
The above information delineates that proposed estimator is more efficient than other estimator for simulating data.
5. Real Life Example
To illustrate the applicability of the proposed estimators over other estimators through real data sets, we used data in [6]. The data used in this study to illustrate the application of proposed estimator is a database for the amount of real and non-real estate farm loans in different states during 1997. The result, description and necessary data statistics of the populations are given as follows:
y = Amount (in $000) of real estate farm loans in different states during 1997
x = Amount (in $000) of non-real estate farm loans in different states during 1997
Table 2 shows the estimated values of the bias, mean square error and relative efficiency of proposed estimator and different estimator for real data.
Figure 1 shows the bias and mean square error of the proposed estimator is less than the bias and mean square error of the existing random sample estimator, product and ratio estimators. To examine the efficiency of the proposed estimator
over the estimator
,
and
, we calculated the percentage of relative efficiency of different estimator of
with respect to usual estimator
provided in Table 1 and Table 2 and Figure 2. The proposed estimator
performed better than other estimators.
Figure 1. Bias and mean square error of proposed and existing estimators.
Figure 2. Relative efficiency of proposed and existing estimators.
Table 1. Mean square error and relative efficiency comparison of proposed estimator and different estimator of
with respect to the usual estimator
for simulated data.
Table 2. Bias, mean square error and relative efficiency comparison of proposed estimator and different estimator of
with respect to the usual estimator
.
6. Conclusion
In this demonstration, we developed a new regression type estimator using the correlation coefficient Equation (1) and theoretically showed that the proposed estimator is more efficient than other usual ratio estimators, product estimator and simple random sampling mean estimator under some certain conditions. These theoretical conditions are also satisfied by the results of a numerical example.
Acknowledgements
The authors gratefully acknowledge the excellent research facility provided by Bangabandhu Sheikh Mujibur Rahman Science and Technology University. M. Zahid Hasan would like to specifically thank Ministry of Science and Technology Government of the People’s Republic of Bangladesh for research support through R & D.