A Comparison of Four Methods of Estimating the Scale Parameter for the Exponential Distribution ()
1. Introduction
The exponential distribution is one of the most commonly used continuous distributions applied in life data analysis. In particular, it is commonly used for systems exhibiting some form of constant failure rate. For a continuous random variable
, the exponential distribution density function, the pdf, is thus given by
(1)
where
is the constant failure rate parameter. For more details on exponential distribution and its applications, [1] and [2] offer a useful reference.
Empirical Bayes estimators of exponential distribution parameters have been introduced by multiple authors, such as [3] and [4] to support the generation of this function. [5] also investigated the use of the power function distribution as a conjugate prior to the estimation of the parameters required for the exponential distribution, while [6] used a prior drawn from the gamma to assess Bayesian estimation of the exponential distribution, applying the maximum likelihood estimator and three different loss functions to estimate the parameters of the exponential distribution.
This paper aimed to study the estimation of exponential distribution across a variety of loss functions, applying three different criteria-based methods to compare the resulting estimators. These were mean square error (MSE), the Akaike Information Criterion (AIC), and the Bayesian Information Criterion (BIC).
The remainder of this paper is thus organized as follows. In Section 2, the mathematical derivations of the estimation methods are present, while the final model selection is explained in Section 3. The Monte Carlo simulation study results are then used in Section 4 to perform a comparison of the estimation methods, based on applying the appropriate criteria for the mean square error (MSE), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC), allowing a conclusion based on the results findings to be offered in Section 5.
2. Different Estimators of the λ Parameter
This section outlines the various estimates of the parameter of the exponential distribution, as given in (1).
2.1. Maximum Likelihood Estimator (MLE)
The maximum likelihood estimator (MLE) is a technique used for estimating the parameters of a given distribution as discussed in [7] [8] and [9] . Suppose that
; based on this the exponential distribution is as shown in (1), and the likelihood of
can be described as
Taking the natural logarithm of both sides yields
and the MLE estimator of
can thus be obtained by solving the following equation:
Hence,
and the maximum likelihood estimator,
, is then given by
(2)
2.2. Bayesian Estimator
This section demonstrates the process of deriving Bayesian estimates of the scale parameter for the exponential distribution. Three different loss functions are used to achieve this, which are the squared error loss function, the entropy loss function, and the composite LINEX loss function.
The gamma
can be considered as a conjugate prior of
with its density function written in the form
where
and
are the shape parameter and scale parameter, respectively.
The posterior density function of
for the given random sample
is thus obtained as
This in turn implies that the posterior distribution can be written as
which, as can be plainly observed, is a gamma distribution with parameters
and
.
The three different loss functions used to develop a Bayes estimate for the parameter
are discussed below.
2.2.1. Squared-Error Loss Function (SE)
The SE as discussed by [10] and [11] , can be defined as
The Bayesian estimator of
under the squared error loss function is the mean of the posterior density function. The Bayes estimator of
under the squared error loss function is then denoted as
which can be written as
Such that,
(3)
where
indicates the posterior expectation.
2.2.2. Entropy Loss Function
The entropy loss function, as discussed by [12] , can be obtained in the form
The Bayes estimator of
based on the entropy loss function, denoted by
, is then given as
From this, it is possible to derive
as follows:
The Bayesian estimation of
based on the entropy loss function is therefore written as
(4)
2.2.3. Composite LINEX Loss Function
The composite LINEX loss function is defined by [13] and is given as
The Bayes estimator of
based on composite LINEX loss function is denoted as
and can be expressed as
To find
,
Here, in a similar manner,
where
The Bayesian estimation of
based on the composite LINEX loss function can thus be expressed as
(5)
3. Model Selection Criterion
To compare the efficiency of MLE, SE, BEN, and BCL, estimators the mean square error, Akaike information criterion, and Bayesian information criterion methods were used to test their accuracy.
3.1. Mean Square Error (MSE)
The mean square error references the mean squared distance between observed and predicted values.
The MSE is thus calculated as
(6)
where
is the estimator of the parameter
on the ith run and n is the sample size.
Estimates values with the lowest rates of MSE are preferred, as this means that
is closer to the actual values of
.
3.2. Akaike Information Criterion (AIC)
The Akaike information criterion (AIC) is defined by the equation
(7)
where K is the number of estimated parameters and
is the maximum value of the likelihood estimate of the parameters.
Higher values for the likelihood function give a better fit, with the minimum AIC; however, the value of the AIC increases as more parameters are added to the first component.
In the case of small data set,
, the second-order AIC, AICc can be used more effectively. The AICc takes the form
(8)
where
is the bias-correction factor. As n increases,
tends to zero; at that point, the AICc gives results that more closely resemble the AIC.
3.3. Bayesian Information Criterion (BIC)
The Bayesian (or Schwarz) information criterion is expressed as
(9)
Further information on AIC and BIC can be found in [14] and similar references.
The method with the smallest values of MSE, AIC, and BIC can, however, be assumed to be the most efficient method to estimate the parameter to estimate the exponential distribution, offering estimated values of
close to its true value. The best method of estimating the parameter can also be determined by calculating which offers the highest log-likelihood value.
4. The Simulation Study
This section discusses the use of a Monte Carlo simulation to compare the MSE, AIC, and BIC criteria, as defined in (6), (7), and (9) respectively, to estimate the parameter of the exponential distribution based on the classical methods of comparison using MLE and Bayes estimators under the loss functions, including SE, BEN, and BCL, which can be computed as shown in Section 2 in equations 2 to 5. Sample sizes n = 10, 30, 150, 300, and 1,000 were used to achieve this, with data generated from the exponential distribution for the scale parameter
, with an arbitrary prior parameter (α, β, C) = (1, 2, 0.5). The number of replications used was 1,000,000 for each sample size.
The efficiencies of the estimation methods are compared in Table 1 and Table 2. The results of MSE, AIC, and BIC are presented in Table 1, while the 95% CIs, along with average confidence interval length. ALs, as computed using the estimators are displayed in Table 2. Table 1, which shows the estimated values
Table 1. Estimated value of the rate parameter,
, for the true parameter
, mean square error (MSE), Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), with log-likelihood (L) when (α, β, C) = (1, 2, 0.5).
Table 2. The lower (L) the upper(U) and average length AL of 95% confidence intervals for
of exponential distribution.
and the values of MSE, AIC, and BIC with log-likelihoods for the selected sample size, also offers the MSE values obtained using all four estimators approached. For small sample sizes (n = 10, 30), the BEN method can thus be seen to perform better than other estimation methods, based on it offering the smallest value of MSE. With an increasing sample size, however, the BSE method provides the lowest MSE. The results further indicate that the lowest values of AIC and BIC were obtained by using BCL as n increases further, while the values of AIC and BIC are close to each other for all estimation methods. Overall, all methods offer good performance with respect to the estimation of the parameter, though the higher the log-likelihood value, the better the parameter estimation method, and the results also suggest that the BCL method gives a higher log-likelihood value than all other approaches. Comparing the results in this paper to that in [8] , we see that both of these results show that the Bayesian method is better than the MLE. The results show that the MSE and L values of all methods decreased with increasing the sample size. The values of AIC and BIC increase as the sample size increases. For all estimation methods the 95% confidence intervals and average lengths for the parameter were computed. The narrow 95% band indicates that confidence levels are high. The results in Table 2 show that AL values decrease as the sample size increases, becoming more similar and closer to each other. The smallest values of ALs were found by applying the BEN method.
5. Conclusion
In this study, four methods of estimating the parameter of the exponential distribution were compared. Estimating the exponential parameter using classical MLE was thus compared with the use of the Bayesian method assessed using three criteria, the MSE, the AIC, and the BIC. The results indicate that the Bayesian method performs better than the maximum likelihood estimator for the estimation of the parameter, based on it having the smallest values of MSE, AIC, and BIC, with narrow 95% CIs and the shortest ALs.
Acknowledgements
Sincere thanks to the members of JAMP for their professional performance.