Paper Type |
: |
Research Paper |
Title |
: |
SERA Noise Estimation for Speech Enhancement |
Country |
: |
India |
Authors |
: |
B. Ravi Teja, Dr.B. Ananda Krishna |
 |
: |
10.9790/2834-0214046  |
Abstract: A practical single channel speech enhancement system consists of two major components, estimation of noise power spectrum and the estimation of speech. Therefore, a crucial component of any algorithm is the estimation of the noise power spectrum for highly non stationary noise environments. The performance of noise estimation algorithm is usually a tradeoff between speech distortion and noise reduction. In existing methods, noise is estimated only during speech pauses and these pauses are identified using Voice Activity Detector (VAD). This paper describes novel noise estimation method SERA (Spectral Entropy Recursive Averaging) to estimate noise in highly non stationary noise environments. In SERA, noise estimation is updated in both speech pauses and also speech present frames. Speech presence is determined by computing the ratio of the noisy speech power spectrum to its local minimum, which is computed by averaging past values of the noisy speech power spectra with a look-ahead factor. Environmental noise is present in all the frames of the noisy speech signal and if the speech/silence detection is not accurate, then it yields speech echoes and residual noise in the enhanced speech. In this paper, noise estimation is updated by dividing speech signal into pure speech, quasi speech and non-speech frames based on adaptive multiple thresholds without using of VAD. The proposed method is compared with weighted average noise estimation method in terms of segmental SNR. The simulation results of the proposed algorithm shows better performance over a system that uses VAD in noise estimation.
Keywords: Entropy, Noise Estimation, Quasi Speech, SERA, Smoothing Constant, Speech Enhancement, Voice Activity Detector (VAD)
[1]. R.SundarRajan and C.L.Philipos, "A Noise Estimation Algorithm for Highly Non-stationary environments," speech communication, Vol.48, PP.220-231, 2006.
[2]. Ch.V.RamaRao, "Noise Estimation for Speech for Enhancement in Non-stationary environments – a new method", World Academy of Science, Engineering and Technology, Vol.70, PP.739-740, 2010.
[3]. T.Lalith Kumar and R.SundarRajan, "Speech Enhancement using Adaptive Filters", VSRD-JJEEE, Vol.2 (2), PP.92-99, 2012.
[4]. C.GaneshBabu and P.T.Vanathi, "Performance Analysis of Voice Activity Detection Algorithm for Robust SpeechRecognition System under Different Noisy Environment", Journal of Scientific & Industrial Research, Vol.69, PP.515-522, July 2010.
[5]. G.Poblinger, "Computationally Efficient Speech Enhancement by Spectral Minima Tracking in Subbands", Proc. Euro Speech 2, PP.1513-1516, 1995.
[6]. R.Martin, "Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics", IEEE Trans speech Audio Process, PP-504-512, 2001.
[7]. P.Loizou, R.Sundarajan and Huy, "Noise Estimation Algorithm with Rapid Adaption for Highly Non-stationary Environments", prec. IEEE international conference on acoustic speech signal Proc, 2004.
[8]. J.Sohn and N.Kim, "Statistical Model Based Voice Activity Detection", IEEE signal ProcLett, 6(1), PP-1-3, 1999.
[9]. S.Tanyer and H.Ozer, "Voice Activity Detection in Non – Stationary Noise", IEEE Speech Audio Proc. 8(4), PP.478-482, 2000.
[10]. P.Loizou, "A Noise Estimation Algorithm with Rapid Adaption for Highly Non-stationary environments, Speech Communication Science direct, PP-220-231, 2006.
[11]. Anu Radha and R.Fuknu, "Noise Estimation Algorithms for Speech Enhancements in Highly Non-stationary Environments", IJCSI, Vol.8, PP.39-44, 2011.
[12]. Cohen, I. and Berdugo, B., "Noise Estimation by Minima Controlled Recursive Averaging for Robust Speech Enhancement," IEEE Signal Proc. Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002.
[13]. Hirsch, H. G. and Ehrlicher, C., "Noise Estimation Techniques for Robust Speech Recognition," in Proc. 20th IEEE Int. Conf. Acoustics, Speech, Signal Processing, Detroit, MI, pp. 153-156, May 8-12, 1995.