On the Mean Difference Variance in Random Samples of Student’s Variables

Abstract

The purpose of this paper is to obtain the expression of the sample mean difference variance of the Student’s distributive model. In the 2007 the study of the mean difference variance, after some decades, was resumed by Campobasso [1]. Using the Nair’s [2] and Lomnicki’s general results [3], he obtained the variance of sample mean difference for different distributive models (Laplaces, triangular, power, logit, Paretos and Gumbel’s model). In addition he extended the knowledge comparing to the ones already known for the other distributive model (normal, rectangular and exponential model).

Share and Cite:

Fabio, M. and Claudia, M. (2020) On the Mean Difference Variance in Random Samples of Student’s Variables. Open Journal of Statistics, 10, 659-663. doi: 10.4236/ojs.2020.104040.

1. The Mean Difference Variance General Expression

Let X be a continuous random variable with density function f(x) and distribution function F(x). Then, let X 1 , X 2 , , X n be a simple random sample from such population; the sample mean difference is

Δ ¯ = i = 1 n j = 1 n | X i X j | n ( n 1 ) (1)

The mean value of Δ ¯ is equal to the mean difference of the population

Δ = + + | x y | f ( x ) f ( y ) d x d y (2)

In 1952 Lomnicki [3] obtained the following general expression of the sample mean difference variance

var ( Δ ¯ ) = 1 n ( n 1 ) [ 4 ( n 1 ) σ 2 + 16 ( n 2 ) I 2 ( 2 n 3 ) Δ 2 ] (3)

in which σ 2 and Δ are the variance and the mean difference of the considered distributive model respectively, whereas

I = + G ( x ) H ( x ) f ( x ) d x (4)

in which

G ( x ) = x ( x y ) f ( y ) d y (5)

and

H ( x ) = G ( x ) + μ x , (6)

in which μ is the mean value of such distributive model.

The mean value μ and the variance are known for almost all the distributive models. Concerning the mean difference Δ , the known results are collected in Girone’s and Mazzitelli’s paper [4].

So, to determine the expression of the sample mean difference variance it’s only needed the calculation of I.

2. I Expression for Odd g

The density function of the Student’s distributive model is

f ( x ) = ( 1 + x 2 / 2 ) g + 1 2 g B ( g / 2 , g / 2 ) , < x < + (7)

in which the parameter g is called number of degrees of freedom.

Using Mathematica software for such model for g = 5 , 7 , , 19 the obtained values of I g are shown in Table 1.

The second term is easily represented by the following formula

Table 1. The obtained values of I g for g = 5 , 7 , , 19 .

g 6 ( g 2 ) . (8)

The first term expression is more complicated to be determined. After several attempts comparing each first term to the previous one, we pointed out the recurring formula:

A g = A g 2 g ( g 4 ) ( g 3 ) 2 ( 3 g 4 ) ( 3 g 8 ) ( g 2 ) 4 ( 3 g 7 ) ( 3 g 11 ) , per g = 5 , 7 , (9)

with the initial value A 3 = 15 / ( 2 π 2 ) .

Considering the previous relation, we came to the following expression of the first I g term for odd g values greater than 3:

A g = 3 g Γ [ ( g 1 ) / 2 ] 2 Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) 2 ( g 2 ) Γ ( g / 2 ) 2 Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) π , (10)

and then the I g expression

I g = 3 g Γ [ ( g 1 ) / 2 ] 2 Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) 2 ( g 2 ) Γ ( g / 2 ) 2 Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) π g 6 ( g 2 ) (11)

3. I Expression for Even g

Using again Mathematica software for g = 4 , 6 , , 2 0 the obtained values of I g are shown in Table 2.

The second term of I g is represented again by the simple formula

g 6 ( g 2 ) . (12)

After several attempts, comparing each first term to the previous one, we pointed out the recurring formula:

A g = A g 2 g ( g 4 ) ( g 3 ) 2 ( 3 g 4 ) ( 3 g 8 ) ( g 2 ) 4 ( 3 g 7 ) ( 3 g 11 ) , per g = 6 , 8 , (13)

Table 2. The obtained values of I g for g = 4 , 6 , , 2 0 .

with the initial value A 4 = 8 / 15 . It has to be noticed that the recurring formula is the same one as the odd case.

Considering the previous relation, we came to the following expression of the first I g term for even g values greater than 4:

A g = 2 3 2 g 3 g ( g 2 ) Γ ( g 2 ) 2 Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 ) 4 Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) (14)

and then the I g expression

I g = 2 3 2 g 3 g ( g 2 ) Γ ( g 2 ) 2 Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 ) 4 Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) g 6 ( g 2 ) (15)

Through some algebraic steps it is easily verified that the two I g formulas for the odd case and the even one are the same and, moreover, a single more compact expression is the following

I g = 2 3 2 g 3 g Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) ( g 1 ) 2 ( g 2 ) B ( g / 2 , g / 2 ) Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) g 6 ( g 2 ) (16)

4. The Sample Mean Difference Variance

Let us remind that for the Student’s distributive model the expressions of the mean value ( μ ), the variance ( σ 2 ) and the mean difference (Δ) are the following:

μ = 0 , (17)

σ 2 = g g 2 , (18)

Δ = g B [ ( g 1 ) / 2 , ( g + 1 ) / 2 ] ( 2 g 1 ) B ( g , g ) B ( g / 2 , g / 2 ) 2 2 g 3 . (19)

Using the Lomnicki’s formula we came to the following expression of the mean difference variance for the Student’s distributive model:

var ( Δ ¯ ) = 4 g ( n + 1 ) 3 ( g 2 ) n ( n 1 ) + 2 5 2 g g Γ [ g 1 / 2 ] 2 Γ [ ( g 1 ) / 2 ] 2 ( 3 2 n ) Γ [ g / 2 ] 6 n ( n 1 ) + 8 3 g Γ [ ( g 1 ) / 2 ] 2 h ( g ) ( n 2 ) π ( g 2 ) Γ [ g / 2 ] 2 n ( n 1 ) (20)

in which

h ( g ) = Γ ( g / 2 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 1 / 6 ) Γ ( g / 2 5 / 6 ) (21)

It is easily checked that, as g diverges,

var ( Δ ¯ ) = 72 48 3 + 4 π ( 48 + 24 3 + 4 π ) n 3 π n ( n 1 ) , (22)

that represents the sample difference variance for the normal model. It is also easily verified that, as n diverges, the above mentioned variance approaches zero, which means that Δ ¯ is also a consistent estimator.

5. Conclusions

The sample mean difference Δ ¯ is a correct estimator of the mean difference population for every distributive model. To verify if it is also consistent or not we need to calculate its variance, in this paper we have obtained the variance of Δ ¯ formal expression for the Student’s distributive model in terms of the parameter g (degrees of freedom) and of the sample size n.

Because, even for the Student’s distributive model, such variance approaches zero as the sample size n diverges, Δ ¯ results consistent. As g diverges the Student’s distributive model tends to the normal one. As a matter of fact the variance of Δ ¯ expression we found approaches the variance of Δ ¯ for the normal distributive model.

Acknowledgements

The helpful and constructive comments of a referee which lead to an improvement of the presentation of the paper and support from the editorial staff of Open Journal of Statistics to process the paper are all gratefully acknowledged.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1] Campobasso, F. (2007) Alcuni risultati sulla varianza della differenza media camp- ionaria. Annals of the Department of Statistical Sciences, 6.
[2] Nair, U.S. (1936) The Standard Error of Gini’s Mean Difference. Biometrika, 28, 428-436.
https://doi.org/10.2307/2333957
[3] Lomnicki, Z.A. (1952) The Standard Error of Gini’s Mean Difference. Annals of Mathematical Statistics, 23, 635-637.
https://doi.org/10.1214/aoms/1177729346
[4] Girone, G. and Mazzitelli, D. (2007) La differenza media nei principali modelli distributivi continui. Annals of the Department of Statistical Sciences, 6.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.