Distribution of the Sample Correlation Matrix and Applications

Abstract

For the case where the multivariate normal population does not have null correlations, we give the exact expression of the distribution of the sample matrix of correlations R, with the sample variances acting as parameters. Also, the distribution of its determinant is established in terms of Meijer G-functions in the null-correlation case. Several numerical examples are given, and applications to the concept of system de- pendence in Reliability Theory are presented.

Share and Cite:

Pham-Gia, T. and Choulakian, V. (2014) Distribution of the Sample Correlation Matrix and Applications. Open Journal of Statistics, 4, 330-344. doi: 10.4236/ojs.2014.45033.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Johnson, D. (1998) Applied Multivariate Methods for Data Analysis. Duxbury Press, Pacific Grove.
[2] Fisher, R.A. (1962) The Simultaneous Distribution of Correlation Coefficients. Sankhya, Series A, 24, 1-8.
[3] Pham-Gia, T. and Turkkan, N. (2011) Distributions of the Ratio: From Random Variables to Random Matrices. Open Journal of Statistics, 1, 93-104.
http://dx.doi.org/10.4236/ojs.2011.12011
[4] Hotelling, H. (1953) New Light on the Correlation Coefficient and Its Transform. Journal of the Royal Statistical Society: Series B, 15, 193.
[5] Olkin, I. and Pratt, J.W. (1958) Unbiased Estimation of Certain Correlation Coefficients. Annals of Mathematical Statistics, 29, 201-211.
http://dx.doi.org/10.1214/aoms/1177706717
[6] Joarder, A.H. and Ali, M.M. (1992) Distribution of the Correlation Matrix for a Class of Elliptical Models. Communications in Statistics—Theory and Methods, 21, 1953-1964.
http://dx.doi.org/10.1080/03610929208830890
[7] Ali, M.M., Fraser, D.A.S. and Lee, Y.S. (1970) Distribution of the Correlation Matrix. Journal of Statistical Research, 4, 1-15.
[8] Fraser, D.A.S. (1968) The Structure of Inference. Wiley, New York.
[9] Schott, J. (1997) Matrix Analysis for Statisticians. Wiley, New York.
[10] Kollo, T. and Ruul, K. (2003) Approximations to the Distribution of the Sample Correlation Matrix. Journal of Multivariate Analysis, 85, 318-334.
http://dx.doi.org/10.1016/S0047-259X(02)00037-4
[11] Farrell, R. (1985) Multivariate Calculation. Springer, New York.
http://dx.doi.org/10.1007/978-1-4613-8528-8
[12] Gupta, A.K. and Nagar, D.K. (2000) Matrix Variate Distribution. Hall/CRC, Boca Raton.
[13] Mathai, A.M. and Haubold, P. (2008) Special Functions for Applied Scientists. Springer, New York.
http://dx.doi.org/10.1007/978-0-387-75894-7
[14] Kshirsagar, A. (1972) Multivariate Analysis. Marcel Dekker, New York.
[15] Muirhead, R.J. (1982) Aspects of Multivariate Statistical Theory. Wiley, New York.
http://dx.doi.org/10.1002/9780470316559
[16] Fisher, R.A. (1915) The Frequency Distribution of the Correlation Coefficient in Samples from an Indefinitely Large Population. Biometrika, 10, 507-521.
[17] Stuart, A. and Ord, K. (1987) Kendall’s Advanced Theory of Statistics, Vol. 1. 5th Edition, Oxford University Press, New York.
[18] Sawkins, D.T. (1944) Simple Regression and Correlation. Journal and Proceedings of the Royal Society of New South Wales, 77, 85-95.
[19] Pham-Gia, T., Turkkan, N. and Vovan, T. (2008) Statistical Discriminant Analysis Using the Maximum Function. Communications in Statistics-Simulation and Computation, 37, 320-336.
http://dx.doi.org/10.1080/03610910701790475
[20] Nagar, D.K. and Castaneda, M.E. (2002) Distribution of Correlation Coefficient under Mixture Normal Model. Metrika, 55, 183-190.
http://dx.doi.org/10.1007/s001840100139
[21] Gupta, A.K. and Nagar, D.K. (2004) Distribution of the Determinant of the Sample Correlation Matrix from a Mixture Normal Model. Random Operators and Stochastic Equations, 12, 193-199.
[22] Springer, M. (1984) The Algebra of Random Variables. Wiley, New York.
[23] Pham-Gia, T. (2008) Exact Distribution of the Generalized Wilks’s Statistic and Applications. Journal of Multivariate Analysis, 99, 1698-1716.
http://dx.doi.org/10.1016/j.jmva.2008.01.021
[24] Pham-Gia, T. and Turkkan, N. (2002) Operations on the Generalized F-Variables, and Applications. Statistics, 36, 195-209. http://dx.doi.org/10.1080/02331880212855
[25] Pitman, E.J.G. (1937) Significance Tests Which May Be Applied to Samples from Any Population, II, the Correlation Coefficient Test. Supplement to the Journal of the Royal Statistical Society, 4, 225-232.
http://dx.doi.org/10.2307/2983647
[26] Rencher, A.C. (1998) Multivariate Statistical Inference and Applications. Wiley, New York.
[27] Jogdev, K. (1982) Concepts of Dependence. In: Johnson, N. and Kotz, S., Eds., Encyclopedia of Statistics, Vol. 2, Wiley, New York, 324-334.
[28] Joe, H. (1997) Multivariate Models and Dependence Concepts. Chapman and Hall, London.
http://dx.doi.org/10.1201/b13150
[29] Bertail, P., Doukhan, P. and Soulier, P. (2006) Dependence in Probability and Statistics. Springer Lecture Notes on Statistics 187, Springer, New York.
http://dx.doi.org/10.1007/0-387-36062-X
[30] Drouet Mari, D. and Kotz, S. (2004) Correlation and Dependence. Imperial College Press, London.
[31] Lancaster, H.O. (1982) Measures and Indices of Dependence. In: Johnson, N. and Kotz, S., Eds., Encyclopedia of Statistics, Vol. 2, Wiley, New York, 334-339.
[32] Hoyland, A. and Rausand, M. (1994) System Reliability Theory. Wiley, New York.
[33] Bekker, A., Roux, J.J.J. and Pham-Gia, T. (2005) Operations on the Matrix Beta Type I and Applications. Unpublished Manuscript, University of Pretoria, Pretoria.

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.