Mean Difference and Mean Deviation of Tukey Lambda Distribution ()
1. Introduction
The purpose of this work is to increase the methodological contributions on the mean difference and on the relationships of the mean difference with other variability indexes [1] [2]. The studies on the mean difference, introduced by Corrado Gini in 1912 as a measure of the variability of the characters according to the aspect of inequality, have aroused the interest of many scholars over years and also recently [3] [4]. The importance of mean difference is also due to the fact that the sample mean difference is a correct estimate of that of the population distribution model and, therefore, functional for inferential purposes [5]. The theoretical contributions on the mean difference concern the main continuous distribution models (normal, rectangular, exponential, ...) [6], however, for other distribution models, such as Tukey’s, no contributions are known in literature.
2. Tukey Lambda Distribution
Tukey lambda distribution is usually used to choose a distribution model to fit data and its direct use is less usual. In general, its characteristic is that neither its density function
nor its cumulative function
is known, but only the inverse of this latter
, that is the quantile function Q(p) [7] [8].
A complete Tukey distribution shape includes three parameters: one of position, one of scale and one of shape [9] [10].
In order to calculate the mean difference and the mean deviation, it is better to refer to a reduced distribution in which the position parameter is set to zero and the scale to one. Formulas of mean difference and mean deviation of complete distribution are equal to the ones of reduced distribution multiplied by the scale parameter value. Tukey lambda distribution is defined by the quantile function
(1)
Said function is not always analytically invertible and, therefore, allows to obtain cumulative function and density function only for some values of λ [11] which are
. Cumulative functions of Tukey lambda distribution for such values are listed below:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
It is necessary to use numerical inversion of
to get a cumulative function for other λ values.
Regarding Tukey distribution, some characteristic values as function of λ are known: average, mode, median, standard deviation, asymmetry index, disnormality excess index, entropy, characteristic function. Expressions of mean difference and mean deviation are unknown.
3. Variability Indexes of Tukey Lambda Distribution
The variance of Tukey lambda distribution as a function of λ parameter [12] is
(12)
By using the cumulative functions derived by the inversion of quantile functions of Tukey lambda distribution, mean difference and mean deviation values are obtained and shown in Table 1.
Mean difference values for integers from 1 to 10 are arranged exactly on a parabolic hyperbola
(13)
Some values of Δ calculated numerically for other values of λ parameter are also all arranged over the said function, which can be then considered a general expression of the mean difference of Tukey lambda distribution. Said function takes not-negative finite values for
, as it can be shown in Figure 1.
Therefore, the mean difference in Tukey lambda distribution has a domain
which is wider than the one of standard deviation
.
Let us now consider the mean deviation. First of all, we can see that the average of our distribution exists only for
and, therefore, said domain also applies to mean deviation. Mean deviation values for integers from 1 to 10 are arranged exactly over the function
(14)
Values of δ calculated numerically for other values of λ parameter are also all arranged exactly over the said function, which can be then considered the expression
Figure 1. Mean difference of Tukey lambda distribution as a function of λ parameter.
Figure 2. Mean deviation of Tukey lambda distribution as a function of λ parameter.
Table 1. Values of mean difference and mean deviation for some values of λ parameter in Tukey lambda distribution.
of mean deviation of the Tukey lambda distribution. Said function takes not-negative finite values for
as it can be shown in Figure 2.
The mean deviation of Tukey lambda distribution has, therefore, a domain wider than the one of standard deviation.
4. Relations between Variability Indexes of Tukey Lambda Distribution
By inverting the expression of mean difference in Tukey lambda distribution as a function of λ parameter (13), the following two roots come out
(15)
and
(16)
The second solution, which is always negative, is not usable to obtain the relationship between ∆ and σ [13].
By substituting the first solution
(15) in the standard deviation expression, it comes out an analytical relationship of the same one related to the mean difference of Tukey lambda distribution:
(17)
Said relationship is represented in Figure 3.
As it can be seen, standard deviation increases quickly when mean difference increases.
Let us, now, consider the relationship between mean difference and mean deviation.
By substituting root
in the formula of mean deviation (14), it comes out the following analytical relationship
(18)
As shown in Figure 4, it is evident that the relationship between the two indexes is almost linear.
Finally, let us consider the relationship between mean deviation and standard deviation of Tukey lambda distribution.
Since it is not possible to obtain
parameter as a function of mean deviation, it is necessary to use a numerical procedure to calculate the two variability indexes values for a consistent set of λ parameter values and to represent pairs of values on a Cartesian axis.
By choosing values of λ: −0.49, −0.48, ..., 5.00, it comes out a numerical relationship as shown in Figure 5.
Figure 3. Analytical relationship between mean difference and standard deviation of Tukey lambda distribution.
Figure 4. Analytical relationship between mean difference and mean deviation of Tukey lambda distribution.
Figure 5. Numerical relationship between mean deviation and standard deviation of Tukey lambda distribution.
As it can be seen, the relationship between mean deviation and standard deviation of Tukey Lambda distribution increases with slow acceleration.
5. Conclusive Remarks
In this work, the formulas of mean difference and mean deviation of Tukey Lambda distribution have been obtained. It is an original contribution aimed at increasing the knowledge about this distribution model. These results allowed us to investigate the relationships among the three main variability indexes, standard deviation, mean deviation and mean difference, regarding Tukey lambda model.
Attributions
Girone Section 1; Massari Section 3; Manca Section 4; D’Uggento Sections 2 and 5.