Power Hamza Distribution with Application to Lifetime Data

Abstract

In this paper, a three-parameter lifetime distribution named power Hamza distribution (PH) is proposed. The PH distribution is a useful generalization of the Hamza distribution which accommodates heavy-tailed, upside-down bathtub and J-shaped hazard rates making it more flexible than the Hamza distribution for modelling various kinds of lifetime data. A comprehensive account of the properties of this distribution is presented. The maximum likelihood estimators of the unknown model parameters are discussed. Finally, a real-life data is analyzed for illustrative purpose proving that the PH outperforms the Hamza distribution and several other lifetime distributions.

Share and Cite:

Enogwe, S. , Nwankwo, C. and Oti, E. (2022) Power Hamza Distribution with Application to Lifetime Data. Journal of Applied Mathematics and Physics, 10, 31-48. doi: 10.4236/jamp.2022.101004.

1. Introduction

The Hamza distribution was introduced by [1] with the cumulative distribution function (cdf) and the corresponding probability density function (pdf) respectively given by

F H ( y ) = 1 [ 1 + θ x ( θ 5 y 5 + 6 θ 4 y 4 + 30 θ 3 y 3 + 120 θ 2 y 2 + 360 θ y + 720 ) ( α θ 5 + 120 ) ] e θ y (1)

and

f H ( x ) = θ 6 α θ 5 + 120 ( α + θ 6 y 6 ) e θ y (2)

for y > 0 , θ > 0 and α > 0 .

It may be observed that the Hamza distribution was obtained by compounding the exponential distribution having scale parameter θ and the gamma distribution having shape parameter 7 and scale parameter θ , with mixing proportions p = α θ 5 / ( α θ 5 + 120 ) and q = α θ 5 / ( α θ 5 + 120 ) such that p + q = 1 .

[1] studied the properties and applications of this distribution in the context of lifetime analysis, showing that the distribution is superior to Lindley distribution due to [2], Ishita distribution by [3] and Pranav distribution by [4], respectively.

The aim of this paper is to introduce a new distribution, called the power Hamza distribution, which is a direct generalization of the Hamza distribution. Some of the distributions proposed using the power transformation include the power Lindley due to [5], power Akash and Shanker proposed by [6] [7], power Ishita and power Aradhana due to [8] [9], power Rama and power Garima due to Abebe et al. [10] [11], power Pranav due to [12], power Sujatha by [13], power Prakaamy by [14]. From the literature reviewed in this paper, all the power transformed distributions were shown to be more flexible than their corresponding baseline distributions and more useful for analyzing complex data structures in various fields of life.

The rest of the paper is organized as follows. The pdf, cdf and hazard rate function of the new distribution is given in Section 2. Section 3 provides a comprehensive account of the properties of the distribution including the moment generating function, moments, skewness, kurtosis, mean residual lifetime, mean deviations, Bonferroni and Lorenz curves, stochastic ordering, entropy measure, stress-strength reliability, distributions and moments of order statistics. In Section 4, the maximum likelihood estimates of the parameters of the distribution are given. Also, Section 5 gives the asymptotic confidence intervals. Section 6 illustrates the proposed model in two real-datasets. The paper is concluded in Section 7.

2. The Power Hamza (PH) Distribution

The probability density function, cumulative distribution function and hazard function of the power Hamza distribution having parameters θ , α and β are provided in as Propositions 1, 2 and 3.

Proposition 1. A random variable X is said to have a PH distribution if its pdf is of the form

f P H ( x ) = β θ 6 α θ 5 + 120 ( α + θ 6 x 6 β ) x β 1 e θ x β ; x > 0 , θ > 0 , α > 0 , β > 0 (3)

Proof. Given the distribution of the Hamza random variable Y defined in (2). Assume that another random variable X is related to Y by the power function X = g ( Y ) = Y 1 / β . Then the distribution of X is the power Hamza distribution. To derive the distribution of X, we notice that X is a one-to-one function of Y and so, x = 0 when y = 0 and x = when y = , which implies that the support of the distribution of X is ( 0 , ) .

Letting x = g ( y ) = y 1 / β , and y = g 1 ( x ) = x β in (2), gives

f H ( g 1 ( x ) ) = θ 6 α θ 5 + 120 ( α + θ 6 x 6 β ) e θ x β , x > 0 (4)

According to [15], the probability density function of a continuous random variable X = g ( Y ) is gotten by

f ( x ) = f ( g 1 ( x ) ) | d y d x | (5)

Substituting (4) and d y / d x = β x β 1 into (5), one obtains

f P H ( x ) = θ 6 α θ 5 + 120 ( α + θ 6 x 6 β ) e θ x β × | β x β 1 | ; x > 0 (6)

Further simplification of (6) yields the pdf of the power Hamza random variable X defined in (3), and the proof of Proposition 1 is complete.

Corollary 1. Let X be a power Hamza random variable, then the function defined by Equation (3) is a pdf.

Proof. We show that f P H ( x ) 0 x and 0 f P H ( x ) d x = 1 are satisfied.

1) f P H ( x ) 0 x

2) 0 f P H ( x ) d x = 0 β θ 6 α θ 5 + 120 ( α + θ 6 x 6 β ) x β 1 e θ x β d x = β θ 6 α θ 5 + 120 [ α 0 x β 1 e θ x β d x + θ 6 0 x 7 β 1 e θ x β d x ]

By setting x = y 1 / β in the above integral and noting that 0 y α 1 e θ y d y = Γ ( α ) / θ α one obtains

0 f P H ( x ) d x = β θ 6 α θ 5 + 120 [ α β 0 e θ y d y + θ 6 β 0 y 6 e θ y d y ] = β θ 6 α θ 5 + 120 [ α β Γ ( 1 ) θ + θ 6 β Γ ( 7 ) θ 7 ] = β θ 6 α θ 5 + 120 [ α β θ + 120 β θ 6 ] = β θ 6 α θ 5 + 120 [ α θ 5 + 120 β θ 6 ] = 1

Henceforth, a random variable X that follows the distribution in (3) is symbolized by X ~ P H ( θ , α , β ) . The power Hamza distribution reduces to the Hamza distribution when β = 1 .

Proposition 2. For x > 0 , θ > 0 , α > 0 and β > 0 , the cdf of X ~ P H ( θ , α , β ) is given by

F P H ( x ) = 1 [ 1 + θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) ( α θ 5 + 120 ) ] e θ x β (7)

Proof. [15] defines the cdf of a continuous random variable X as

F ( x ) = P ( X x ) = 0 x f ( u ) d u (8)

Substituting (3) into (8) leads to

F P H ( x ) = β θ 6 α θ 5 + 120 0 x ( α + θ 6 u 6 β ) u β 1 e θ u β d u (9)

Letting x = u 1 / β , u = x β , d u / d x = β x β 1 , the integral (9) becomes

F P H ( x ) = β θ 6 α θ 5 + 120 [ α θ 0 θ x β e x d x + 1 6 θ 6 0 θ x β x 6 e x d x ] (10)

Applying direct integration to the first part of the square bracket in (10) and integration by parts to the second part the square bracket in (10) gives

0 θ x β e x d x = 1 e θ x β (11)

and

0 θ x β x 6 e x d x = ( θ x β ) 6 e θ x β 6 ( θ x β ) 5 e θ x β 30 ( θ x β ) 4 e θ x β 120 ( θ x β ) 3 e θ x β 360 ( θ x β ) 2 e θ x β 720 θ x β e θ x β + 720 (12)

respectively. Substituting (11) and (12) into (10), we obtain

F P H ( x ) = β θ 6 α θ 5 + 120 [ α θ ( 1 e θ x β ) + 1 6 θ 6 ( ( θ x β ) 6 e θ x β 6 ( θ x β ) 5 e θ x β 30 ( θ x β ) 4 e θ x β 120 ( θ x β ) 3 e θ x β 360 ( θ x β ) 2 e θ x β 720 θ x β e θ x β + 720 ) ] (13)

Further simplification of (13) gives (7), which completes the proof of Proposition 2.

Proposition 3. Let X ~ P H ( θ , α , β ) , then the hazard rate function of X is given by

h P H ( x ) = β θ 6 ( α + θ 6 x 6 β ) x β 1 [ θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) + 6 ( α θ 5 + 120 ) ] (14)

Proof. The proof of Proposition 3 follows from using (4) and (7) in the relation

h P H ( x ) = f P H ( x ) 1 F P H ( x ) (15)

Figure 1 and Figure 2 demonstrate the graphs of the pdf and hazard function of the PH distribution for different values of θ , α and β .

(a) (b)

Figure 1. (a) pdf plot of power hamza distribution; (b) pdf plot of power hamza distribution.

(a) (b)

Figure 2. (a) hazard plot of power hamza distribution; (b) hazard plot of power hamza distribution.

3. Properties of the Power Hamza (PH) Distribution

3.1. Moment Generating Function

Proposition 4. Let X ~ P H ( θ , α , β ) , then the moment generating function of X is given by

M X ( t ) = r = 0 6 α θ 5 Γ ( r / β + 1 ) + Γ ( r / β + 7 ) 6 ( α θ 5 + 120 ) r ! θ r / β ( t θ 1 / β ) r (16)

Proof. The moment generating function of X ~ P H ( θ , α , β ) is obtained as follows

M X ( t ) = E ( e t X ) = 0 e t x f P H ( x ) d x = β θ 6 α θ 5 + 120 0 e t x ( α + θ 6 x 6 β ) x β 1 e θ x β d x = β θ 6 α θ 5 + 120 r = 0 t r r ! [ α 0 x β + r 1 e θ x β d x + θ 6 0 x 7 β + r 1 e θ x β d x ] (17)

Letting x = y 1 / β , y = x β and d x = ( 1 / β ) y β 1 d y , (17) reduces to

M X ( t ) = β θ 6 α θ 5 + 120 r = 0 t r r ! [ α β 0 y r / β e θ y d y + θ 6 β 0 x r / β + 6 e θ y d y ] = θ 6 α θ 5 + 120 r = 0 t r r ! [ α Γ ( r / β + 1 ) θ r / β + 1 + θ Γ ( r / β + 7 ) 6 θ r / β + 7 ] (18)

Further simplification of (18) gives (16), which completes the proof of Proposition 4.

3.2. Non-Central Moment

Proposition 5. Let X ~ P H ( θ , α , β ) , then the rth non-moment of X is given by

μ r = 6 α θ 5 Γ ( r / β + 1 ) + Γ ( r / β + 7 ) 6 θ r / β ( α θ 5 + 120 ) ; r = 1 , 2 , 3 , (19)

Proof. The rth moment of X ~ P H ( θ , α , β ) is obtained as follows

μ r = E ( X r ) = 0 x r f P H ( x ) d x = β θ 6 α θ 5 + 120 0 x r ( α + θ 6 x 6 β ) x β 1 e θ x β d x = β θ 6 α θ 5 + 120 [ α 0 x β + r 1 e θ x β d x + θ 6 0 x 7 β + r 1 e θ x β d x ] (20)

Putting x = y 1 / β , y = x β and d x = ( 1 / β ) y β 1 d y into (20), yields

μ r = θ 6 α θ 5 + 120 [ α 0 y r β 1 e θ y d y + θ 6 0 y ( r β + 6 ) 1 e θ y d y ] = θ 6 α θ 5 + 120 [ α Γ ( r / β + 1 ) θ r / β + 1 + θ Γ ( r / β + 7 ) 6 θ r / β + 7 ] (21)

Simplifying (21) a little bit, we get (19), hence the proof of Proposition 5.

Corollary 2. The first four non-central moments of X ~ P H ( θ , α , β ) are

μ 1 = 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) 6 θ 1 / β ( α θ 5 + 120 ) = μ (22)

μ 2 = 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) 6 θ 2 / β ( α θ 5 + 120 ) (23)

μ 3 = 6 α θ 5 Γ ( 3 / β + 1 ) + Γ ( 3 / β + 7 ) 6 θ 3 / β ( α θ 5 + 120 ) (24)

μ 4 = 6 α θ 5 Γ ( 4 / β + 1 ) + Γ ( 4 / β + 7 ) 6 θ 4 / β ( α θ 5 + 120 ) (25)

Proof. The proof of (22)-(25) follows directly from Proposition 5 by substituting r = 1 , 2 , 3 and 4 into (19).

3.3. Variance

Apart from the non-central moments, variance of a distribution is always relevant for measuring the spread from the mean. So, we provide the variance of the PH distribution in Proposition 6.

Proposition 6. Let X ~ P H ( θ , α , β ) , then the variance of X is given by

σ 2 = 6 ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) 2 36 θ 2 / β ( α θ 5 + 120 ) 2 (26)

Proof. The variance of a random variable X can be computed using the relation

σ 2 = μ 2 ( μ 1 ) 2 (27)

The proof of (26) follows directly from substituting (22) and (23) into (27)

σ 2 = 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) 6 θ 2 / β ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) 6 θ 1 / β ( α θ 5 + 120 ) ) 2 (28)

3.4. Coefficient of Variation and Index of Dispersion

The coefficient of variation is given by

c v = σ μ = [ 6 ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) 2 36 θ 2 / β ( α θ 5 + 120 ) 2 ] 1 / 2 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) 6 θ 1 / β ( α θ 5 + 120 ) = [ 6 ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) 2 ] 1 / 2 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) (29)

The index of dispersion is given by

γ = σ 2 μ = [ 6 ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) 2 36 θ 2 / β ( α θ 5 + 120 ) 2 ] 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) 6 θ 1 / β ( α θ 5 + 120 ) = 6 ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 2 / β + 1 ) + Γ ( 2 / β + 7 ) ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) 2 6 θ 1 / β ( α θ 5 + 120 ) ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) ) (30)

3.5. Central Moment

Proposition 7. Let X ~ P H ( θ , α , β ) , then the central moment of X is given by

μ r = i = 0 r ( r i ) ( 1 ) r i ( 6 α θ 5 Γ ( 1 / β + 1 ) + Γ ( 1 / β + 7 ) 6 θ 1 / β ( α θ 5 + 120 ) ) r i × ( 6 α θ 5 Γ ( i / β + 1 ) + Γ ( i / β + 7 ) 6 θ i / β ( α θ 5 + 120 ) ) (31)

Proof. The rth moment of the X ~ P H ( θ , α , β ) can be obtained from the relation

μ r = E ( X μ ) r = i = 0 r ( r i ) ( μ ) r i E ( X i ) (32)

Using (22) and (19) in (32) gives (31) and the proof of Proposition 7 is complete.

3.6. Conditional Moment

A function that is useful in deriving the mean residual life function of a component as well as the mean deviations is the conditional moment. Given that X follows a power Hamza distribution with parameters θ , α and β , then

E ( X n | X > x ) = 1 ( 1 F ( x ) ) J n ( x ) (33)

where

J n ( x ) = x t n f B E I ( t ) d t = β θ 6 α θ 5 + 120 x t n ( α + θ 6 t 6 β ) t β 1 e θ t β d t = β θ 6 α θ 5 + 120 [ α x t β + n 1 e θ t β d t + θ 6 x t 7 β + n 1 e θ t β d t ] (34)

Letting x = θ t β in (34) leads to

J n ( x ) = β θ 6 α θ 5 + 120 [ α β θ β / n + 1 θ x β x ( β / n + 1 ) 1 e x d x + θ 6 β θ β / n + 7 θ x β x ( β / n + 7 ) 1 e x d x ] = β θ 6 α θ 5 + 120 [ α β θ β / n + 1 Γ ( β n + 1 , θ x β ) + θ 6 β θ β / n + 7 Γ ( β n + 7 , θ x β ) ]

J n ( x ) = α θ 5 Γ ( β n + 1 , θ x β ) + Γ ( β n + 7 , θ x β ) 6 ( α θ 5 + 120 ) θ β / n (35)

where Γ ( a , z ) = x t a 1 e t d t is the complementary incomplete gamma function.

Also from (7), we get

( 1 F ( x ) ) = [ 1 + θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) ( α θ 5 + 120 ) ] e θ x β (36)

Plugging (35) and (36) into (33), we have the conditional moment as

E ( X n | X > x ) = [ α θ 5 Γ ( β n + 1 , θ x β ) + Γ ( β n + 7 , θ x β ) ] e θ x β [ θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) + 6 ( α θ 5 + 120 ) ] θ β / n (37)

3.7. Mean Residual Life Function

In many life testing experiments, it is always of interest to know the additional lifetime given that a component has survived until a certain amount of time. To achieve this purpose, the mean residual life function (MRL), which refers to the expected remaining life, X x , given that the item has survived up to time x, is required. It may be observed from (37), that the MRL function is derived from the conditional as follows

m X ( x ) = E ( X x | X > x ) x (38)

Putting n = 1 in (37) and substituting the result into (38), we obtain MRL function as

m X ( x ) = [ α θ 5 Γ ( β + 1 , θ x β ) + Γ ( β + 7 , θ x β ) ] e θ x β [ θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) + 6 ( α θ 5 + 120 ) ] θ β x (39)

3.8. Mean Deviations

In statistical modelling, it is often an interest to measure the spread in a population from either the mean or the median. To achieve this, two indices, namely mean deviation about the mean ( μ d ) and mean deviation about the median ( M d ) are used. Let μ denote the mean and M, the median of a power Hamza distributed random variable X. The values of μ d and M d can be calculated using the relationships

μ d = 0 | x μ | f P H ( x ) d x = 2 μ F ( μ ) 2 μ + 2 J 1 ( μ ) (40)

and

M d = 0 | x M | f P H ( x ) d x = 2 J 1 ( M ) μ (41)

respectively. By replacing x with μ and M in (7) and (35), yields the following

μ d = 2 μ [ 1 ( 1 + θ μ β ( θ 5 μ 5 β + 6 θ 4 μ 4 β + 30 θ 3 μ 3 β + 120 θ 2 μ 2 β + 360 θ μ β + 720 ) ( α θ 5 + 120 ) ) e θ μ β ] 2 [ 6 μ ( α θ 5 + 120 ) θ β ( α θ 5 Γ ( β + 1 , θ μ β ) + Γ ( β + 7 , θ μ β ) ) ] 6 ( α θ 5 + 120 ) θ β (42)

M d = 2 [ α θ 5 Γ ( β + 1 , θ M β ) + Γ ( β + 7 , θ M β ) 3 μ ( α θ 5 + 120 ) θ β ] 3 ( α θ 5 + 120 ) θ β (43)

3.9. Bonferroni and Lorenz Curves

It has been found that the Bonferroni curve proposed by [16] and Lorenz curve proposed by [17] have applications in the fields of economics, reliability, demography, insurance, medicine, among others. So, if X ~ P H ( θ , α , β ) , the Bonferroni and Lorenz curves are respectively given by

B ( p ) = 1 p μ 0 q x f P H ( x ) d x (44)

and

L ( p ) = 1 μ 0 q x f P H ( x ) d x (45)

where μ = E ( X ) and q = F 1 ( p ) , 0 < p < 1 . Hence, for the power Hamza pdf (3), one gets

0 q x f P H ( x ) d x = β θ 6 α θ 5 + 120 0 q x ( α + θ 6 x 6 β ) x β 1 e θ x β d x = β θ 6 α θ 5 + 120 [ α 0 q x β e θ x β d x + θ 6 0 q x 7 β e θ x β d x ] (46)

Letting w = θ x β , x = ( w / θ ) 1 / β , d x = ( 1 / β ) ( 1 / θ ) 1 / β w 1 / β 1 d w and noting that x ( 0 , q ) implies w ( 0 , θ q β ) , (1) becomes

0 q x f P H ( x ) d x = θ 6 α θ 5 + 120 [ α ( 1 θ ) ( β + 1 β ) 0 θ q β w ( β + 1 β ) 1 e w d w + θ 6 ( 1 θ ) ( 7 β + 1 β ) 0 θ q β w ( 7 β + 1 β ) 1 e w d w ] = θ 6 α θ 5 + 120 [ α ( 1 θ ) ( β + 1 β ) Γ ( β + 1 β , θ q β ) + θ 6 ( 1 θ ) ( 7 β + 1 β ) Γ ( 7 β + 1 β , θ q β ) ] = θ 6 ( 1 θ ) β 2 + 1 β [ 6 α θ 6 β Γ ( β 2 + 1 β , θ q β ) + θ Γ ( 7 β 2 + 1 β , θ q β ) ] 6 θ 6 β ( α θ 5 + 120 ) (47)

Substituting (46) and (47) into (44) and (45), we obtain the Bonferroni and Lorenz curves, respectively, for the power Hamza distribution as

B ( p ) = θ 6 ( 1 θ ) β 2 + 1 β [ 6 α θ 6 β Γ ( β 2 + 1 β , θ q β ) + θ Γ ( 7 β 2 + 1 β , θ q β ) ] 6 p θ 6 β μ ( α θ 5 + 120 ) (48)

L ( p ) = θ 6 ( 1 θ ) β 2 + 1 β [ 6 α θ 6 β Γ ( β 2 + 1 β , θ q β ) + θ Γ ( 7 β 2 + 1 β , θ q β ) ] 6 θ 6 β M ( α θ 5 + 120 ) (49)

3.10. Stochastic Ordering

In this section, we discuss the comparative behaviour of the power Hamza random variable using the stochastic ordering. In line with [18], a power Hamza random variable X is said to be smaller than another random variable Y in the 1) stochastic order ( X Y s t ) if F X ( x ) F Y ( x ) x , 2) hazard rate order ( X Y h r ) if h X ( x ) h Y ( x ) x , 3) mean residual life order ( X Y m r l ) if m X ( x ) m Y ( x ) x and 4) likelihood ratio order ( X Y l r ) if f X ( x ) / f Y ( x ) a decreasing function of x. To show the flexibility of the power Hamza distribution, we present the following Proposition.

Proposition 8. Let X ~ ( θ 1 , α 1 , β 1 ) and Y ~ ( θ 2 , α 2 , β 2 ) be two independent random variables. If 1) θ 1 > θ 2 , α 1 = α 2 and β 1 = β 2 ; 2) α 1 > α 2 , θ 1 = θ 2 , β 1 = β 2 ; 3) β 1 < β 2 , θ 1 = θ 2 , α 1 = α 2 and 4) θ 1 > θ 2 , α 2 < α 1 , β 1 < β 2 , then X l r Y , X h r Y , X m l r Y and X s t Y .

Proof. The likelihood ratio is

f X ( x ; θ 1 , α 1 , β 1 ) f Y ( x ; θ 2 , α 2 , β 2 ) = β 1 θ 1 6 ( α 2 θ 2 5 + 120 ) β 2 θ 2 6 ( α 1 θ 1 5 + 120 ) x β 1 β 2 ( 6 α 1 + θ 1 x 6 β 1 6 α 2 + θ 2 x 6 β 2 ) e ( θ 1 x β 1 θ 2 x β 2 ) (50)

The log-likelihood ratio is

ln { f X ( x ; θ 1 , α 1 , β 1 ) f Y ( x ; θ 2 , α 2 , β 2 ) } = ln [ β 1 θ 1 6 ( α 2 θ 2 5 + 120 ) β 2 θ 2 6 ( α 1 θ 1 5 + 120 ) ] + ( β 1 β 2 ) ln x + ln ( 6 α 1 + θ 1 x 6 β 1 6 α 2 + θ 2 x 6 β 2 ) ( θ 1 x β 1 θ 2 x β 2 ) (51)

Differentiating the log-likelihood with respect to x, we get

d d x ln { f X ( x ; θ 1 , α 1 , β 1 ) f Y ( x ; θ 2 , α 2 , β 2 ) } = 36 ( 6 θ 1 α 2 β 1 x 6 β 1 1 + θ 2 α 1 β 2 x 6 β 2 1 ) + 6 θ 1 θ 2 ( β 1 β 2 ) x 6 ( β 1 β 2 ) 1 ( 6 α 1 + θ 1 x 6 β 1 ) ( 6 α 2 + θ 2 x 6 β 2 ) + ( β 1 β 2 ) x ( θ 1 β 1 x β 1 1 θ 2 β 2 x β 2 1 ) (52)

Since d d x ln { f X ( x ; θ 1 , α 1 , β 1 ) f X ( x ; θ 1 , α 1 , β 1 ) } < 0 for conditions 1), 2), 3) and 4), then X l r Y and hence, X h r Y , X m l r Y and X s t Y , which completes the proof of Proposition 8.

3.11. Rényi Entropy

To quantify the amount of information (such as the diversity, uncertainty, or randomness) contained in a random sample drawn from a population, the entropy is utilized. It may be noted that a large value of entropy indicates that the data contains greater uncertainty. Several studies have applied entropy in the fields of physics, probability and statistics, communication theory, economics, among others. Therefore, we derive one of the commonly used entropies, namely the R nyi entropy. For ρ > 0 and ρ 1 , the Rényi entropy due to [19] is defined for a continuous random variable as

E ρ = 1 1 ρ log ( 0 f P H ρ ( x ) d x ) (53)

Using the pdf (3) in (53), we obtain

E ρ = 1 1 ρ log [ 0 ( β θ 6 α θ 5 + 120 ( α + θ 6 x i 6 β ) x i β 1 e θ x i β ) ρ d x ] = 1 1 ρ log [ ( β θ 6 α θ 5 + 120 ) ρ 0 ( α + θ 6 x 6 β ) ρ x ρ ( β 1 ) e θ ρ x β d x ] = 1 1 ρ log [ j = 1 ρ ( ρ j ) θ 6 ρ + j α ρ j β ρ 6 j ( α θ 5 + 120 ) ρ 0 x 6 β + ρ ( β 1 ) e θ ρ x β d x ]

= 1 1 ρ log [ j = 1 ρ ( ρ j ) 6 j θ 6 ρ + j α ρ j β ρ 1 ( α θ 5 + 120 ) ρ 0 y ( 6 j + ρ 1 / ρ + 1 / β ) 1 e θ ρ y d y ] y = x β

E ρ = 1 1 ρ log [ j = 1 ρ ( ρ j ) 6 j θ 6 ρ + j α ρ j β ρ 1 ( α θ 5 + 120 ) ρ Γ ( 6 j + ρ 1 / ρ + 1 / β ) ( θ ρ ) 6 j + ρ 1 / ρ + 1 / β ] (54)

3.12. Distribution of Order Statistics for the PH Distribution

Suppose X ( 1 ) < X ( 2 ) < < X ( n ) constitutes the order statistics for a random sample X 1 , X 2 , , X n drawn from the power Hamza distribution with pdf (3) and cdf (7). Then the pdf of the rth order statistic X ( r ) can be written as

f X ( r ) ( x ) = n ! ( r 1 ) ! ( n r ) ! f P H ( x ) ( F P H ( x ) ) r 1 ( 1 F P H ( x ) ) n r ; r = 1 , 2 , , n = β θ 6 n ! ( α θ 5 + 120 ) ( r 1 ) ! ( n r ) ! u = 0 n r ( 1 ) u ( n r u ) x β 1 e θ x β ( α + θ 6 x 6 β ) × [ 1 ( 1 + θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) ( α θ 5 + 120 ) ) e θ x β ] r + u 1 (55)

Putting r = 1 into (55) gives the pdf of the first order statistic X ( 1 ) as

f X ( 1 ) ( x ) = n β θ 6 α θ 5 + 120 x β 1 e θ x β ( α + θ 6 x 6 β ) × [ ( 1 + θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) ( α θ 5 + 120 ) ) e θ x β ] n 1 (56)

Putting r = n into (55) gives the pdf of the nth order statistic X ( n ) as

f X ( n ) ( x ) = n β θ 6 α θ 5 + 120 x β 1 e θ x β ( α + θ 6 x 6 β ) × ( 1 [ 1 + θ x β ( θ 5 x 5 β + 6 θ 4 x 4 β + 30 θ 3 x 3 β + 120 θ 2 x 2 β + 360 θ x β + 720 ) ( α θ 5 + 120 ) ] e θ x β ) n 1 (57)

4. Maximum Likelihood Estimators of the Power Hamza Distribution

Let X 1 , X 2 , , X n denote a random sample of size n from the PH distribution having parameters θ , α and β . To estimate the parameters θ , α and β using the maximum likelihood method, we define the likelihood function of the random sample form the PH distribution as

L ( θ , α , β ) = i = 1 n f ( x i ; θ , α , β ) = ( β θ 6 α θ 5 + 120 ) n e θ i = 1 n x i β i = 1 n ( α + θ 6 x i 6 β ) x i β 1 (58)

Taking the natural log of (58), we obtain the log-likelihood function as

ln L ( θ , α , β ) = n [ ln ( β ) + 6 ln ( θ ) ln ( α θ 5 + 120 ) ] + i = 1 n ln ( α + θ 6 x i 6 β ) + ( β 1 ) i = 1 n x i θ i = 1 n x i β (59)

Differentiating (59) with respect to θ , α and β respectively and equating the resulting derivatives to zero, one obtains

ln L ( θ , α , β ) θ = 6 n θ 5 n α θ 4 α θ 5 + 120 + i = 1 n x i 6 β 6 α + θ x i 6 β i = 1 n x i β = 0 (60)

ln L ( θ , α , β ) α = n θ 5 α θ 5 + 120 + i = 1 n 6 6 α + θ x i 6 β = 0 (61)

ln L ( θ , α , β ) β = n β + 6 θ i = 1 n x i 6 β ln ( x i ) 6 α + θ x i 6 β + i = 1 n ln ( x i ) θ i = 1 n x i β ln ( x i ) = 0 (62)

The above non-linear systems of equations are solved by numerical iteration technique and maximum likelihood estimates are obtained. Since the maximum likelihood estimates for θ , α and β are not in closed form we use the large sample behaviour of maximum likelihood estimators to obtain the confidence intervals for model parameters.

5. Asymptotic Confidence Intervals of the Power Hamza Distribution

In this section, we present the asymptotic confidence intervals for the parameters of the PH distribution. Let ψ ^ = ( θ ^ , α ^ , β ^ ) T be the maximum likelihood estimate of ψ = ( θ , α , β ) T . Under the conditions that the parameters are in the interior of the parameter space, but not on the boundary, the asymptotic distribution of n ( ψ ^ ψ ) is N 3 ( 0 , I 1 ( ψ ) ) , where I ( ψ ) is the expected Fisher information matrix. The asymptotic behaviour of the expected information matrix can be approximated by the observed information matrix, denoted by I n ( ψ ^ ) . The observed information matrix of the power Hamza distribution is given by

I n ( ψ ^ ) = ( 2 ln L ( θ , α , β ) θ 2 2 ln L ( θ , α , β ) θ ^ α ^ 2 ln L ( θ , α , β ) θ ^ β ^ 2 ln L ( θ , α , β ) α ^ θ ^ 2 ln L ( θ , α , β ) α ^ 2 2 ln L ( θ , α , β ) α ^ β ^ 2 ln L ( θ , α , β ) β ^ θ ^ 2 ln L ( θ , α , β ) β ^ α ^ 2 ln L ( θ , α , β ) β ^ 2 ) (63)

Thus,

I n 1 ( ψ ^ ) = ( n I ( ψ ^ ) ) 1 = ( var ( θ ^ ) cov ( θ ^ , α ^ ) cov ( θ ^ , β ^ ) cov ( α ^ , θ ^ ) var ( α ^ ) cov ( α ^ , β ^ ) cov ( β ^ , θ ^ ) cov ( β ^ , α ^ ) var ( β ^ ) ) (64)

Taking the second order derivatives of (59) with respect to θ , α and β are, respectively, we obtain the entries of (63) as follows

2 ln L ( θ , α , β ) θ 2 = 6 n θ + 5 n α θ 3 ( α θ 5 480 ) ( α θ 5 + 120 ) 2 i = 1 n x i 12 β ( 6 α + θ x i 6 β ) 2 (65)

2 ln L ( θ , α , β ) α 2 = n θ 10 ( α θ 5 + 120 ) 2 i = 1 n 36 ( 6 α + θ x i 6 β ) 2 (66)

2 ln L ( θ , α , β ) β 2 = n β 2 + 216 α θ i = 1 n x i 6 β ln 2 ( x i ) ( 6 α + θ x i 6 β ) 2 θ i = 1 n x i β ln 2 ( x i ) (67)

2 ln L ( θ , α , β ) α θ = 600 n θ 4 ( α θ 5 + 120 ) 2 6 i = 1 n x i 6 β ( 6 α + θ x i 6 β ) 2 (68)

2 ln L ( θ , α , β ) β θ = 36 α i = 1 n x i 6 β ln ( x i ) ( 6 α + θ x i 6 β ) 2 i = 1 n x i β ln ( x i ) (69)

2 ln L ( θ , α , β ) α β = 36 i = 1 n x i 6 β ln ( x i ) ( 6 α + θ x i 6 β ) 2 (70)

The expectations in the Fisher information matrix can be obtained numerically. The multivariate normal distribution with mean vector ( 0 , 0 , 0 ) T and covariance matrix I 1 ( ψ ) can be used to construct confidence intervals for the model parameters. The approximate 100 ( 1 η ) % two-sided confidence intervals for θ , α and β are determined by

θ ^ ± Z η / 2 var ( θ ^ ) , α ^ ± Z η / 2 var ( α ^ ) and β ^ ± Z η / 2 var ( β ^ ) (71)

respectively, where Z η / 2 is the upper ( η / 2 ) th percentile of a standard normal distribution.

6. Applications

In this section, we provide an application to real data set to demonstrate the importance and flexibility of the PH distribution.

The data set is on the breaking strength of carbon fibres of 50 mm length (GPa). The data has been previously used by [20] and [21]. The data is as follows:

0.39, 0.85, 1.08, 1.25, 1.47, 1.57, 1.61, 1.61, 1.69, 1.80, 1.84, 1.87, 1.89, 2.03, 2.03, 2.05, 2.12, 2.35, 2.41, 2.43, 2.48, 2.50, 2.53, 2.55, 2.55, 2.56, 2.59, 2.67, 2.73, 2.74, 2.79, 2.81, 2.82, 2.85, 2.87, 2.88, 2.93, 2.95, 2.96, 2.97, 3.09, 3.11, 3.11, 3.15, 3.15, 3.19, 3.22, 3.22, 3.27, 3.28, 3.31, 3.31, 3.33, 3.39, 3.39, 3.56, 3.60, 3.65, 3.68, 3.70, 3.75, 4.20, 4.38, 4.42, 4.70, 4.90.

We fitted the PH distribution to the data set by using the method of maximum likelihood and the results are compared with five other competitive lifetime distributions namely,

1) Hamza distribution (HD) defined in Equation (2),

2) Weighted Weibull distribution (WWD):

f ( x ; θ , β ) = θ β c + 1 x β ( c + 1 ) 1 e θ x β Γ ( c + 1 ) ; x > 0 , θ > 0 , β > 0 , c > 1 ,

3) Two-Parameter Weibull distribution (TPWD):

f ( x ; θ , β ) = θ β x β 1 e θ x β ; x > 0 , θ > 0 , β > 0 ,

4) Pareto distribution (PD):

f ( x ; α ) = α x m α x m α + 1 ; x > 0 , α > 0 ,

5) Exponential distribution (ED):

f ( x ; θ ) = θ e θ x ; x > 0 , θ > 0 .

We used the goodness-of-fit test based on the Kolmogorov-Smirnov test due to ( [22] [23] [24] [25] ) with its corresponding p-value to verify that the data set under consideration actually follow the proposed distribution. The computational formula for this goodness-of-fit test is given by

KS = max { i n F ^ ( x ( i ) ) , F ^ ( x ( i ) ) i 1 n } (72)

where F ^ ( x i ) is the estimated distribution function under the ordered data. Since there is more than one distribution to be compared, the distribution with the largest KS p-value will be more appropriate to fit the given sample.

We shall also determine the appropriate model from among all models compared for the real data set by considering three discrimination criteria, based on the log-likelihood function evaluated at the maximum likelihood estimates, the Akaike information criterion (AIC) due to [26] and the Bayesian information criterion (BIC) due to [27], respectively. To compute the AIC and BIC, the following formulae are used

Table 1. Parameter estimates, standard errors, log-likelihood values and goodness-of-fit measures.

AIC = 2 l + 2 k (73)

BIC = 2 l + k log ( n ) (74)

where l denotes the log-likelihood function evaluated at the maximum likelihood estimates, k is the number of parameters in the statistical model and n is the sample size of the fitted data respectively. All the computations for (72)-(74) were performed using R software. Generally, for the given data-sets, we consider a distribution to be best among all competing distributions if it has smallest AIC value, the smallest BIC value, the smallest log-likelihood value and the largest p-value.

As shown in Table 1, the PH distribution has the largest KS p-value and smallest AIC, BIC and log-likelihood values as compared to other fitted distributions. Hence, the PH distribution is better than the other distributions in Table 1 for fitting the data under consideration.

7. Conclusion

This study introduced a new distribution, called the power Hamza distribution using power transformation method. The contribution of this paper has to do with addition of skewness to the Hamza distribution, which depends only on one parameter. The density function of the power Hamza distribution can take various forms depending on its shape parameter. The hazard rate function of the power Hamza distribution exhibits heavy-tailed shape, upside-down bathtub shape and J-shape, which implies that the distribution can be used for analyzing lifetime and survival time datasets. A detailed discussion of the properties of the proposed distribution has been given. Estimates of the three unknown parameters of the PH distribution are obtained using the maximum likelihood estimation method. The PH distribution was fitted to a real dataset and compared to five distributions and the results showed that the proposed distribution outperformed all of them in modelling the data under consideration.

Acknowledgements

The authors are thankful to the editor and reviewers of this article for providing very useful comments which led to the improvement of this paper.

Authors’ Contribution

All authors contributed equally in developing the article.

Funding Statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1] Aijaz, A., Jallal, M., Ain Ul, S.Q. and Tripathi, R. (2020) The Hamza Distribution with Statistical Properties and Applications. Asian Journal of Probability and Statistics, 8, 28-42.
https://doi.org/10.9734/ajpas/2020/v8i130198
[2] Lindley, D.V. (1958) Fiducial Distributions and Bayes. Journal of the Royal Statistical Society, Series B, 20, 102-107.
https://doi.org/10.1111/j.2517-6161.1958.tb00278.x
[3] Shanker, R. and Shukla, K.K. (2017) Ishita Distribution and Its Applications. Biometrics & Biostatistics International Journal, 5, 1-9.
https://doi.org/10.15406/bbij.2017.05.00126
[4] Shukla, K.K. (2018) Pranav Distribution with Properties and Its Applications. Biometrics and Biostatistics International Journal, 7, 244-254.
https://doi.org/10.15406/bbij.2018.07.00215
[5] Ghitany, M.E., Al-Mutairi, D.K., Balakrishnan, N. and Al-Enezi, L.J. (2013) Power Lindley Distribution and Associated Inference. Computational Statistics and Data Analysis, 64, 20-33.
https://doi.org/10.1016/j.csda.2013.02.026
[6] Shanker, R. and Shukla, K.K. (2017) Power Akash Distribution and Its Applications. Journal of Applied Quantitative Methods, 12, 1-1012.
[7] Shanker, R. and Shukla, K.K. (2017) Power Shanker Distribution and Its Application. Turkiye Klinikleri Journal of Biostatistics, 9, 175-187.
https://doi.org/10.5336/biostatic.2017-56601
[8] Shukla, K.K. and Shanker, R. (2018) Power Ishita Distribution and Its Applications to Model Lifetime Data. Statistics in Transition New Series, 19, 135-148.
https://doi.org/10.21307/stattrans-2018-008
[9] Shanker, R. and Shukla, K.K. (2018) A Two-Parameter Power Aradhana Distribution with Properties and Application. Indian Journal of Industrial and Applied Mathematics, 9, 210-220.
https://doi.org/10.5958/1945-919X.2018.00016.6
[10] Abebe, B., Tesfay, M., Eyob, T. and Shanker, R. (2019) A Two-Parameter Power Rama Distribution with Properties and Applications. Biometrics and Biostatistics International Journal, 8, 6-11.
https://doi.org/10.15406/bbij.2019.08.00262
[11] Abebe, B., Tesfay, M., Eyob, T. and Shanker, R. (2019) A Two-Parameter Power Garima Distribution with Properties and Applications. Annals of Biostatistics and Biometric Applications, 1, 1-7.
https://doi.org/10.33552/ABBA.2019.01.000515
[12] Shukla, K.K. (2019) Power Pranav Distribution and Its Applications. Journal of Applied Quantitative Methods, 14, 1-13.
[13] Shanker, R. and Shukla, K.K. (2019) A Two-Parameter Power Sujatha Distribution with Properties and Application. International Journal of Mathematics and Statistics, 20, 11-22.
[14] Shukla, K.K. and Shanker, R. (2020) Power Prakaamy Distribution and Its Applications. International Journal of Computations and Theoretical Statistics, 7, 26-36.
[15] Hogg, R.V., McKean, J.W. and Craig, A.T. (2019) Introduction to Mathematical Statistics. 8th Edition, Pearson Education, Inc., London.
[16] Bonferroni, C.E. (1930) Elementi di Statistica Generale, Seeber, Firenze.
[17] Lorenz, M.O. (1905) Methods of Measuring the Concentration of Wealth. Publications of the American Statistical Association, 9, 209-219.
https://doi.org/10.2307/2276207
[18] Shaked, M. and Shanthikumar, J.G. (1994) Stochastic Orders and Their Applications. Academic Press, Boston.
[19] Rényi, A. (1961) On Measure of Entropy and Information. In: Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, University of California Press, Berkeley, 547-561.
[20] Cordeiro, G.M. and Lemonte, A.J. (2011) The β-Birnbaum-Saunders Distribution: An Improved Distribution for Fatigue Life Modelling. Computational Statistics and Data Analysis, 55, 1445-1461.
https://doi.org/10.1016/j.csda.2010.10.007
[21] Al-Aqtash, R., Lee, C. and Famoye, F. (2014) Gumbel-Weibull Distribution: Properties and Applications. Journal of Modern Applied Statistical Methods, 13, 201-225.
https://doi.org/10.22237/jmasm/1414815000
[22] Kolmogorov, A. (1933) Sulla determinazione empirica di una lgge di distribuzione. Giornale dell’Istituto Italiano degli Attuari, 4, 83-91.
[23] Smirnoff, N. (1939) Sur les écarts de la courbe de distribution empirique. Matematicheskii Sbornik, 48, 3-26.
[24] Scheffé, H. (1943) Statistical Inference in the Non-Parametric Case. The Annals of Mathematical Statistics, 14, 305-332.
https://doi.org/10.1214/aoms/1177731355
[25] Wolfowitz, J. (1949) Non-Parametric Statistical Inference. In: Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Berkeley, 93-113.
[26] Akaike, H. (1974) A New Look at the Statistical Model Identification. IEEE Transactions on Automatic Control, 19, 716-723.
https://doi.org/10.1109/TAC.1974.1100705
[27] Schwarz, G. (1978) Estimating the Dimension of a Model. Annals of Statistics, 6, 461-464.
https://doi.org/10.1214/aos/1176344136

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.