1. Introduction
Poisson regression model is widely used for modeling response variables that are counted. It is discussed by [1] . Practically, a common method used to estimate parameters is the maximum likelihood estimator (MLE). Unfortunately, this technique is high sensitivity to outliers in data, see ( [2] [3] ). To overcome this issue, many robust are an alternative to Maximum Likelihood Estimates. One of the first robust methods used to estimate the parameters in Poisson regression models is the Conditionally Unbiased Bounded Influence introduced by [4] . [5] discussed the M-estimator for Poisson regression model, these estimates belong to Mallows-type. [6] developed robust M-estimates for generalized linear models (GLM), these estimates are asymptotically normal and consistent. [7] proposed a fast and stable technique based on breakdown point of the trimmed maximum likelihood for generalized linear models. [8] introduced the class of M-estimators based on quasi likelihood estimators proposed by [9] . [10] developed a robust estimator for Poisson regression model based on Mallows quasi-likelihood estimator. [11] discussed the behaviour of maximum likelihood estimator in the present of outliers. [12] discussed a robust resistant estimator based on the misclassification model. [13] generalized Optimally Bounded Score Function discussed by [14] for linear models to the generalized linear model. More recently, [15] introduced a robust method for logistic regression models. [16] discussed the robust estimators for Poisson regression model with outliers.
In this paper, we introduced a robust method for Poisson regression by using the weight functions proposed by [17] these weight functions are based on Mallow’s type estimator, moreover, to evaluate the performance of the new methods with (MLE), Mallows, and (CUBIF) using the Monte Carlo simulation study. In Section 2, we discuss the Poisson regression model and the maximum likelihood estimators. In Section 3, we provide robust estimators for Poisson regression. In Section 4, we show the results of Monte Carlo simulation study. In Section 5, we offer the conclusions.
2. Poisson Regression Model and ML Estimator
Poisson regression is proper method to model a count data. Probability mass function is
(1)
where:
and
, that is mean the Poisson regression has equal mean and variance. Based on a sample
, the Poisson regression model in terms of the mean of response can be written as follow:
(2)
where
are disturbance terms. The relationship between the mean of the dependent variable and explanatory variable can be describe by use the log-link function:
(3)
where
is the explanatory variables and
is the parameters of regression. The popular method used to estimate parameters in Poisson regression models is the maximum likelihood estimation, the likelihood function of the response variables
is:
(4)
, with according to (1),
and
. We can write the total likelihood as follows:
We can define the MLE as:
. To get the estimates of maximum likelihood for this model, we can maximizing the likelihood function by differentiating it respect to
. While, maximizing the likelihood function has no closed form solution so may use the (Fisher Scoring) or the iteratively weighted least squares algorithm (IWLS) to get the maximum likelihood estimates see ( [1] [18]).
In this paper, we focus on maximum weighted likelihood estimators for Poisson regression model, the maximum weighted likelihood estimator is:
There
, with
,
, and
3. Robust Poisson Regression
In robust Poisson regression, Mallows-type estimator introduced by [4] can be applied to fit the data a count variables, this method minimizes the weighted log-likelihood function. [5] studied Mallows-type estimator deeply and introduced a robust method for generalized linear models. We can measure the leverage of observation x by used the following:
(5)
where
is the robust location estimator and
is the robcation estimator
and
, can be calculated by using minimum covariance determinate (MCD) method. We can get the Mallows-type estimator fust variance-covariance matrix of the predictor variables
. The robust scale and loor Poisson regression by solution the equations:
(6)
where
, w is a non increasing function such that
is bounded. [5] introduced choosing w depends on a constant
.
this estimate knows as Mallows-type estimator or weighted maximum likelihood estimator (WMLE).
In this paper, we introduced a robust methods for Poisson regression model they are based on maximum weighted likelihood estimators. The weight of this method depends on the function introduced by [17]. We first calculate the initial scatter and location estimators of predictor
and
respectively. then, compute the squared Mahalanobis distances of predictor which can be defined as:
The weight function we introducing can be defined as follows: first weight:
, where
indicate to squares Mahalanobis distances, then:
second weight:
, then, we can write in the form of:
Then, the maximum weighted likelihood estimators for Poisson regression model can be gained by a solution the following form:
(7)
For compute the maximum weighted likelihood estimators we used algorithm of Mallows-type introduced by [5] .
4. Evaluation of the Robust Methods
In order to test the performance of the above estimates, we conduced Monte Carlo simulation study for comparing the a new methods with the Maximum likelihood estimator (MLE), Mallows type estimator for [4] and [5] .
4.1. Monte Carlo Simulation Study
In this subsection, we examine the perform of the new robust methods (WMLEw1, WMLEw2) and compare with the maximum likelihood estimate (MLE), Mallows-type estimator (Mallows) of [5] and the conditionally unbiased bounded influence (CUBI) of [4] . The simulation study includes three models. First model is clean model, second model is 5% of data contaminated and third model is 10% of data are contaminated. In the three models, we generated the explanatory variables
from standard normal distribution
, and the victor of parameter is
, with four sample size
, these values were chose to represent moderate and large samples.
The response variables
are generated from poisson distribution with
with
. The outliers are distributed according to Poisson distribution with mean
: where IQR is the interquartile range. To examine the perform of these estimators, we compute the Bias and mean squared error (MSE) for the three models. For all scenarios, we run 1000 repetitions. However, a good estimator is the one has small Bias and MSE. Therefore, we compute the bias and MSE for each parameter as follows:
and
4.2. Results from the Monte Carlo Simulation Study
It is seen in Table 1 for clean model, the values of the Bias and MSE for maximum likelihood estimator (MLE), (Mallows) and (CUBIF) are smaller those of new weighted estimators (WMLE1), (WMLE2). We can conclude that the WMLE1 and WMLE2 estimators perform less compared to others in clean models. But when the 5% of data are contaminated in second scenario (Table 2) and 10% of data are contaminated in third scenario (Table 3), the weighted maximum likelihood estimators (WMLE1 and WMLE2) has lower bias and (MSE) compered with others methods, that is mean our new methods perform better compered with other estimators.
Table 1. Bias and MSE of estimators for clean model.
Table 2. Bias and MSE of estimators when 5% of data are contaminated.
Table 3. Bias and MSE of estimators when 10% of data are contaminated.
5. Conclusion
In this paper, we suggested new robust estimators for Poisson regression: the weighted maximum likelihood estimators (WMLE1 and WMLE2). To examine the performance of suggested estimators, we conducted a Monte Carlo simulation study to compare the suggested estimators with the classical maximum likelihood estimator (ML), Mallows and CUBIF. The result of simulation study shows that in Table 1 (clean model), the maximum likelihood estimator, CUBIF and Mallows perform close to each other, while the proposed estimators have lower performance compared to other estimators. In contaminated models (Table 2 and Table 3), the new weighted maximum likelihood estimators (WMLE1 and WMLE2) performs better compared with other estimators. The extent of new estimators proposed in this paper to other generalized linear models would be an interesting subject to follow.
Acknowledgements
We thank the editor and the referee for their comments.