1. Introduction
In survival analysis, problems are encountered in the analysis of clinical data because distributions proposed are not flexible enough to follow the movement of the data to give accurate results. In light of this, there is a need to develop a more flexible parametric model using Covid-19 data for example. In recent times, there was the outbreak of the third wave of Covid-19 pandemic called Delta Variant after the second wave generating a global outcry. Many researches/works have been done by several researchers since the breakup of the pandemic in December 2019 from various fields, such as Medicine, Statistics, Economics, etc., with different ideas, models, methods and approaches in their respective works. These include Badmus et al. (2020), Dey et al. (2020), WHO, (2020), Yoo, (2020) amongst others [1] [2] [3] [4] . Moharraza et al. [5] in their work developed a simple model to assess the spread of global Covid-19 pandemic and tested the validity of their model using the cases of the coronavirus disease compiled by the WHO. Meanwhile, the results from the analysis supported the validity of their developed model.
Zheng and Bobasera [6] compared and contrast the Spanish flu and Covid-19 pandemic using an approach called a “powerful approach” to quantify the degree of mean distance between them and reported according to their results that vaccines from the progress in science and technology have reduced the death probability in some countries. E.g., about 0.0001 United Kingdom about 0.001 in Italy, the United States, Canada, and San Marino considered in their work
The main goal of the extensions of the proposed model is to enhance the robustness and flexibility of the classical model. Also, motivating factors, including the model, are a family of beta generalized models in such that when equating one or more parameter(s) to one (1), the model becomes the baseline distribution. In addition, the model has capability to increase and decrease shapes due to heavily skewed and heavy tail probability distribution that plays an important role in modeling skewed data like clinical data.
Most clinical data are always skewed, thus a new distribution is constructed and generated from a parent distribution called Rayleigh Lomax (RL) distribution by Kawsar et al. [7] is generated using beta link function introduced by Jones [8] . This is expected to have different shapes for the survival and hazard rate functions. More parameters are added to the parent distribution, and the flexibility and capability of the distribution to model real life data are established. There is still room for further study from the topic such as “Log-Beta Raleigh Lomax model, Modified Raleigh Lomax distribution and mixture of Rayleigh, Exponentiated Rayleigh Lomax, Lehmann Type II Rayleigh Lomax, Lomax and exponential distribution in modeling real life phenomenon”.
The paper is arranged in the following order. Section 2 contains material and methods which involve the properties of the proposed Extended Rayleigh Lomax distribution. In Section 3, we obtain the moments and generating function. Section 4 includes an estimation of parameters, analysis of secondary data obtained from Covid-19 weekly report in Nigeria, results and discussion. Section 5 includes based on the findings.
2. Material and Methods
There are several methods in literature which have been used by many researchers. In this study, we consider beta logit function introduced by Jones [8] , which can jointly convolute two or more distributions.
2.1. Properties of Extended Rayleigh Lomax (ERL) Distribution
Density Function
The density function of the above distribution is obtained using the beta link function given as:
(1)
and
and
where, C(z) and c(z) are cdf and pdf of the parent distribution respectively, a and b are additional shape parameters to the parent distribution.
(2)
where, θ, λ and β are initial scale and shape parameters, while a and b are new parameters called shape, introduced to the distribution. Then (2) becomes extended Rayleigh Lomax (ERL) Distribution and it plots are shown in Figure 1.
Some new and existing distributions from the extended Rayleigh Lomax distribution are mentioned below:
1) When
in (2), we have the Lehmann Type II Rayleigh Lomax Distribution (New)
2) If
in (2), we get the pdf of exponentiated Rayleigh Lomax distribution (New)
3) When
in (2), this consists of beta Lomax distribution (New)
4) When
,
and
in (2), it then becomes beta Rayleigh distribution (New)
5) If
in (2), it yields Rayleigh Lomax distribution which is the parent distribution [7] .
6) When
in (2), this consists exponential Lomax distribution by El-Bassiouny et al. [9] .
7) If
in (2), it yields Rayleigh Lomax distribution which is the parent distribution [10] .
2.2. Distribution Function of BRL Distribution
The associative distribution function cdf in (2) is given as
![]()
Figure 1. Depicts the: (1) pdf, plot of density function; (b) cdf, plot of distribution function; (c) survival, plot of survival rate function; and (d) hazard function in plots of ERL distribution, plot of hazard rate function.
(3)
We set
(4)
Also
Putting dx in Equation (2), we realize:
(5)
and c in Equation (4) becomes
Equation (5) can be expressed as
(6)
Now, putting (2) in (6), we get
where,
and it is called the incomplete beta function.
(7)
(7) becomes the cumulative distribution function of ERL distribution.
2.3. The Survival Rate/Reliability Function
The reliability function of BRL distribution is given by
(8)
2.4. The Hazard Rate Function
(9)
2.5. The Reversed Hazard Rate Function
(10)
2.6. Testing the Trueness of the PDF of ERL Distribution
The ERL distribution is a probability density function with the use of:
(11)
Jones (2004) in his generalized beta distribution of first kind is given by:
where
and
, therefore differentiating (p) above, we obtain
Putting
, then differentiating M with respect to G
Therefore,
.
Hence, the
distribution has a true continuous probability density function.
3. Moments and Generating Function
In this section, we derive and obtain mgf of the distribution
and the general rth moment of a beta generated distribution defined by Hosking [11]
(12)
Cordeiro et al. [12] also discussed another mgf for generated beta distribution.
(13)
where,
then,
(14)
Putting pdf and cdf of the Extended Rayleigh Lomax distribution into Equation (14), we get
(15)
If
in Equation (14) that becomes the moment generating function of the baseline distribution.
Hence, the rth moment of the ERL distribution is obtained, since the moment generating function of the parent distribution is given by
(16)
Equation (16) can be re written as
(17)
and the rth moment of ERL distribution is obtained from Equation (17)
(18)
Letting
in (18) gives the rth moment of the baseline distribution by Kawsar et al. [7]
Other measures such as the Skewness (SKERLD) (p, q, β, λ, θ) and Kurtosis (KTERLD) (p, q, β, λ, θ) are also obtained below:
The rth moment of the ERL distribution is written as:
That is,
where,
I.e.
and
therefore,
(19)
where,
At the same time, the first four central moments
are obtained through (17) as:
Furthermore, the mean and second to fourth moments of the ERL distribution are given as follows:
,
,
, and
(20)
(21)
(22)
(23)
Other measures such as skewness, kurtosis and coefficient of variation of the ERL distribution are given below.
3.1. Skewness of the ERL Distribution
The skewness is a means of measuring non symmetry of the distribution. The skewness is given by:
(24)
3.2. Kurtosis of the ERL Distribution
The kurtosis is another measure that measures the peak of the distribution. The kurtosis of the BRL distribution is given as:
(25)
3.3. Coefficient of Variation of the ERL Distribution
This is also a measure of variability of a probability distribution. The CV of the ERL distribution is given as:
(26)
4. Estimation of Parameter
We made attempt to derive the maximum likelihood estimates (MLEs) of the ERL distribution parameters including: θ, λ, β, a and b which are scale and shape parameters. According to Cordeiro et al. [12] , the log likelihood function is given as:
(27)
and
are vectors
If c = 1, it becomes Equation (27) which leads to
(28)
and
have been stated at the beginning.
The log likelihood function of ERL distribution is given as:
(29)
Taking the differentiation in respect to p, q, θ, λ and β give the following:
(30)
(31)
(32)
(33)
(34)
4.1. Analysis of Data
The data used for the analysis is a secondary data obtained from Covid-19 situation weekly epidemiological report 39; 5th-11th July, 2021 (NCDC website state the website) [13] : Thirty-six (36) States including Federal Capital Territory (FCT) with reported laboratory-confirmed Covid-19 cases, recoveries, deaths, samples tested and active cases (37 data points); and was accessed on Thursday 22nd July, 2021 put date accesses at reference not here. Only the death cases from all states of the federation are used for the analysis.
4.2. Result and Discussion
The summary of goodness of fit statistics is used to check for normality of the data; skewness, kurtosis, Anderson Darling (AD), Kolmogorov Smirnov (KS) and Cramer-Von-Mises (CVM) shown in Table 1 with their values clearly indicate that the data does not follow normal distribution since p-values less than 5%, skewness greater than 0 (zero) and kurtosis also greater than 3 [14] [15] . While, graphs from Figure 2 show the nature of the data, the scatter, theoretical quantiles, boxplot, histogram, density and empirical cumulative distribution
![]()
Figure 2. The scatter, theoretical quantiles, boxplot, histogram, density and distribution plot.
function (ecdf) plot show the data is skewed. For instance, non-linearity by scatter and quantiles plots, outliers by boxplot and skewness by histogram and density plots. The minimum and maximum values in the data set are inclusive.
The results obtained in Table 2 are based on parameter estimates by method of maximum likelihood estimation (MLEs). The standard error values are in
![]()
Table 1. Summary of goodness of fit statistics of death cases data.
![]()
Table 2. Contains the MLE, standard error (in parenthesis) and model selection criteria.
bracket for all the models. The model ERLD is compared with other six models: ExpLD, LRLD, BRD, RLD, ExpRLD and BLD. Also, model selection criterion is performed on all models considered in the study. From the results, ERLD has the smallest values in all as we can see bold and starred where AIC = 2607.878, CAIC = 2609.813, HQIC = 2610.718 and BIC = 2615.933, which indicates that it is a robust and flexible model.
5. Conclusion
Despite the level of Nigerian Covid-19 death cases data set, the ERL distribution follows the movement of the data and has better representation of the data than any of the other existing distributions. The proposed distribution, being flexible and versatile, can accommodate increasing, decreasing, bathtub and unimodal shape hazard function. It is therefore useful and effective in the analysis of clinical and survival data.