Iterative spectral subtraction method for millimeter-wave conducted speech enhancement
Sheng Li, Jian-Qi Wang, Ming Niu, Xi-Jing Jing, Tian Liu
.
DOI: 10.4236/jbise.2010.32024   PDF    HTML     5,692 Downloads   10,649 Views   Citations

Abstract

A non-air conducted speech detecting method has been developed in our laboratory by using millimeter wave radar technology. Because of the special attributes of the millimeter wave, this method may considerably extend the capabilities of traditional speech detecting methods. However, radar speech is substantially degraded by additive combined noises that include radar harmonic noise, electrocircuit noise, and ambient noise. This study, therefore, proposed an iterative spectral subtraction method which can be adaptively estimate noise spectrum at every iteration, and reduce the musical noise remained in the previous spectral subtraction process. Results from simulations as well as evaluations confirm that the proposed method satisfactorily reduces whole-frequency and musical noises and produces good speech quality.

Share and Cite:

Li, S. , Wang, J. , Niu, M. , Jing, X. and Liu, T. (2010) Iterative spectral subtraction method for millimeter-wave conducted speech enhancement. Journal of Biomedical Science and Engineering, 3, 187-192. doi: 10.4236/jbise.2010.32024.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Li, S., Scherer, R.C., Wan, M., Wang, S. and Wu, H. (2006) The effect of glottal angle on intraglottal pressure. Journal of the Acoustical Society of America, 119(1), 539–548.
[2] Li, S., Scherer, R.C., Wan, M., Wang, S. and Wu, H. (2006) Numerical study of the effects of inferior and superior vocal fold surface angles on vocal fold pressure distributions. Journal of the Acoustical Society of America, 119(5), 3003–3010.
[3] Yanagisawa, T. and Furihata, K. (1975) Pickup of speech signal utilization of vibration transducer under high ambient noise. J. Acoust. Soc. Jpn, 31(3), 213–220.
[4] Li, Z.-W., (1996) Millimeter wave radar for detecting the speech signal applications. International journal of Infrared and Millimeter Waves, 17(12), 2175–2183.
[5] Holzrichter, J.F., Burnett, G..C. and Ng, L.C. (1998) Speech articulator measurements using low power EM-wave sensors. J. Acoust. Soc. Am, 103(1), 622–625.
[6] Hu, R. and Raj, B. (2005) "A robust voice activity detector using an acoustic Doppler radar," in IEEE Workshop on Automatic Speech Recognition and Understanding, 319–324.
[7] Quatieri, T.F., Brady, K., Messing, D. and Campbell, J.P., (2006) Exploiting nonacoustic sensors for speech encoding. IEEE Transactions on Audio, Speech and Language Processing, 14(2), 533–544.
[8] Boll, S.F. (1979) Suppression of acoustic noise in speech using spectral subtraction, IEEE Transactions on Acoustics, Speech and Signal Processing, 27, 113–120.
[9] Lockwood, P. and Boudy, J. (1992) Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and projection, for robust recognition in cars. Speech Commun, 11, 215–228.
[10] Hansen, J.H.L., (1994) Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect. IEEE Trans. Speech Audio Process, 2, 598–614.
[11] Liu, H., Zhao, Q., Wan, M. and Wang, S. (2006) Application of spectral subtraction method on enhancement of electrolarynx speech. J. Acoust. Soc. Am, 120(1), 398–406.
[12] Kamath, S. and Loizou, P. (2002) A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. IEEE International Conference on Acoustics, Speech, and Signal Processing, 4, 4160–4164.
[13] Udrea, R.M., Ciochina, S. and Vizireanu, D.N. (2005) Multi-band bark scale spectral over-subtraction for colored noise reduction. International Symposium on Signals, Circuits and Systems, 1, 311–314.
[14] Wang, J.Q., Zheng, C.X., Jin, X.J. and Lu, G..H, (2004) Study on a non-contact life parameter detection system using millimeter wave. Space Medicine & Medical Engineering, 17(3), 157–161.
[15] Wang, J., Zheng, C., Lu, G.. and Jing, X. (2007) A new method for identifying the life parameters via radar. EURASIP Journal on Advances in Signal Processing, 2007(1), 8–16.
[16] Berouti, M., Schwartz, R. and Makhoul, J. (1979) Enhancement of speech corrupted by acoustic noise, Proc. IEEE Int. Conf. Acoust., Speech, Signal Process, 208–211.
[17] Lim, J.S. and Oppenheim, A.V. (1978) All-Pole Modeling of Degraded Speech. IEEE Trans. Acoust. Speech, Signal Processing, ASSP, 26, 197–210.
[18] Ogata, S. and Shimamura, T. (2001) Reinforced spectral subtraction method to enhance speech signal, Electrical and Electronic Technology. TENCON. Proceedings of IEEE Region 10 International Conference, 1, 242–245.
[19] Cohen, I. and Berdugo, B. (2002) Noise estimation by minima controlled recursive averaging for robust speech enhancement. IEEE SIGNAL PROCESSING LETTERS, 9(1), 12–15.
[20] Cohen, I. and Berdugo, B. (2001) Speech enhancement for non-stationarynoise environments. Signal Processing, 81, 2403–2418.
[21] Rangachari, S. and Loizou, P. C. (2006) A noise-estimation algorithm for highly non-stationary environments. Speech Communication, 48, 220–231.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.